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TITLE 

SUPPRESSION OF SPECIFIC CLASSES 
OF SOYBEAN SEED PROTEIN GENES 
FIELD OF THE INVENTION 
5 This invention concerns the construction of transgenic soybean lines 

wherein the expression of genes encoding seed storage proteins is modified to 
effect a change in seed storage protein profile of transgenic plants. Such modified 
transgenic soybean lines are used for the production of novel soy protein products 
with unique and valuable functional characteristics. 
10 BACKGROUND OF THE INVENTION 

Soybean seeds contain from 35% to 55% protein on a dry weight basis. 
The majority of this protein is storage protein, which is hydrolyzed during 
germination to provide energy and metabolic intermediates needed by the 
developing seedling. The soybean seed's storage protein is an important 
1 5 nutritional source when harvested and utilized as a livestock feed. In addition, it 
is now generally recognized that soybeans are the most economical source of 
protein for human consumption. Soy protein or protein isolates are already used 
extensively for food products in different parts of the world. Much effort has been 
devoted to improving the quantity and quality of the storage protein in soybean 
20 seeds. 

The seeds of most plant species contain what are known in the art as seed 
storage proteins. These have been classified on the basis of their size and 
solubility (Higgins, T. J. (1984) Ann. Rev. Plant Physiol. J5:191-221). While not 
every class is found in every species, the seeds of most plant species contain 

25 proteins from more than one class. Proteins within a particular solubility or size 
class are generally more structurally related to members of the same class in other 
species than to members of a different class within the same species. In many 
species, the seed proteins of a given class are often encoded by multigene families, 
sometimes of such complexity that the families can be divided into subclasses 

30 based on sequence homology. 

There are two major soybean seed storage proteins:glycinin (also known as 
the 1 IS globulins) and p-conglycinin (also known as the 7S globulins). Together, 
they comprise 70 to 80% of the seed's total protein, or 25 to 35% of the seed's dry 
weight. Glycinin is a large protein with a molecular weight of about 360 kDa. It 

35 is a hexamer composed of the various combinations of five major isoforms 

(commonly called subunits) identified as Gl, G2, G3, G4 and G5. Each subunit is 
in turn composed of one acidic and one basic polypeptide held together by a 
disulfide bond. Both the acidic and basic polypeptides of a single subunit are 
coded for by a single gene. Hence, there are five non-allelic genes that code for 

1 
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the five glycinin subunits. These genes are designated Gy 1 . Gy2, Gy3, Gy4 and 
Gy5, corresponding to subunits Gl, G2, G3, G4 and G5, respectively (Nielsen, N. 
C. et al. (1989) Plant Cell 7:313-328). 

Genomic clones and cDNA's for glycinin subunit genes have been 
5 sequenced and fall into two groups based on nucleotide and amino acid sequence 
similarity. Group I consists of Gyl, Gy2, and Gy3, whereas Group II consists of 
Gy4 and Gy5. There is greater than 85% similarity between genes within a group 
(i.e., at least 85% of the nucleotides of Gyl, Gy2 and Gy3 are identical, and at 
least 85% of the nucleotides of Gy4 and Gy5 are identical), but only 42% to 46% 
1 0 similarity between the genes of Group I and Group II. 

P-Conglycinin (a 7S globulin) is a heterogeneous glycoprotein with a 
molecular weight ranging from 1 50 and 240 kDa. It is composed of varying 
combinations of three highly negatively charged subunits identified as a, a' and p. 
cDNA clones representing the coding regions of the genes encoding the the a and 
1 5 a' subunits have been sequenced and are of similar size; sequence identity is 

limited to 85%. The sequence of the cDNA representing the coding region of the 
P subunit, however, is nearly 0.5 kb smaller than the a and a' cDNAs. Excluding 
this deletion, sequence identity to the a and a' subunits is 75-80%. The three 
classes of p-conglycinin subunits are encoded by a total of 1 5 subunit genes 
20 clustered in several regions within the genome soybean (Harada, J. J. et al. (1 989) 
Plant Ceil 7:415-425). 

New soy based products such as protein concentrates, isolates, and 
textured protein products are increasingly utilized in countries that do not 
necessarily accept traditional oriental soy based foods. Use of these new products 
25 in food applications, however, depends on local tastes and functional 

characteristic of the protein products relative to recipe requirements. Over the 
past 10 years, significant effort has been aimed at understanding the functional 
characteristics of soybean proteins. Examples of functional characteristics include 
water sorption parameters, wettability, swelling, water holding, solubility, 
30 thickening, viscosity, coagulation, gelation characteristics and emulsification 
properties. A large portion of this body of research has focused on study of the 
P-conglycinin and glycinin proteins individually, as well as how each of these 
proteins influences the soy protein system as a whole (Kinsella, J. E. et al. (1985) 
New Protein Foods 5:107-179; Morr, C. V. (1987) JAOCS 67:265-271; Peng, L. 
35 C. et al. (1984) Cereal Chem 67:480-489). Because functional properties are 

directly related to physiochemical properties of proteins, the structural differences 
of p-conglycinin and glycinin result in these two proteins having significantly 
different functional characteristics. Differences in thermal aggregation, 
emulsifying properties, and water holding capacity have been reported. In 
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addition, gelling properties vary as well, with glycinin forming gels that have 
greater tensile strain, stress, and shear strength, better solvent holding capacity, 
and lower turbidity. However, soy protein products produced today are a biend of 
both glycinin and P-conglycinin and therefore have functional characteristics 
5 dependent on the blend of glycinin' s and P-conglycinin's individual 

characteristics. For example, when glycinin is heated to 100°C, about 50% of the 
protein is rapidly converted into soluble aggregates. Further heating results in the 
enlargement of the aggregates and in their precipitation. The precipitate consists 
of the glycines basic polypeptides; the acidic polypeptides remain soluble. The 

1 0 presence of p-conglycinin inhibits the precipitation of the basic polypeptides by 
forming soluble complexes with them. Whether heat denaturation is desireable or 
not depends on the intended use. If one could produce soy protein products 
containing just one or the other storage protein, products requiring specific 
physical characteristics derived from particular soy proteins would become 

1 5 available or would be more economical to produce. 

Over the past 20 years, soybean lines lacking one or more of the various 
storage protein subunits (null mutations) have been identified in the soybean 
germplasm or produced using mutational breeding techniques. Breeding efforts to 
combine mutational events have resulted in soybean lines whose seeds contain 

20 about half the normal amount of p-conglycinin (Takashashi, K. et al. (1994) 

Breeding Science 44:65-66; Kitamura, J. (1995) JARQ 2P:l-8). The reduction of 
P-conglycinin is controlled by three independent recessive mutations. 
Recombining glycinin subunit null mutations have resulted in lines whose seeds 
have significantly reduced amounts of glycinin (Kitamura, J. (1995) JARQ 

25 29: 1 -8). Again, reduction is controlled by three independent recessive mutations. 
Developing agronomically viable soybean varieties from the above lines, in which 
the seed contains only glycinin or p-conglycinin, will be time consuming and 
costly. Each cross will result in the independent segregation of the three 
mutational events. In addition, each mutational event will need to be in the 

30 homozygous state. Development of high yielding agronomically superior soybean 
lines will require the screening and analysis of a large number of progeny over 
numerous generations. 

Antisense technology has been used to reduce specific storage proteins in 
seeds. In Brassica napus, napin (a 2S albumin) and cruciferin (an 1 IS globulin) 

3 5 are the two major storage proteins, comprising about 25% and 60% of the total 
seeds protein, respectively. Napin proteins are coded for by a large multi-gene 
family of up to 16 genes; several cDNA and genomic clones have been sequenced 
(Josefsson, L.-G. et al. (1987) J. Biol Chem 252:12196-12201; Schofield, S. and 
Crouch, M. L. (1987)7. Biol Chem. 262:12202-12208). The genes exhibit 
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greater than 90% sequence identity in both their coding and Hanking regions. The 
cruciferin gene family is equally complex, comprising 3 subfamilies with a total 
of 8 genes (Rodin. J. et al. ( 1 992) Plant Mol. Biol. 20:559-563). Kohno-Murase et 
al. (( 1 994) Plant Mol. Biol. 26:1 1 15-1 124) demonstrated that a napin antisense 
5 gene using the napA gene driven by the napA promoter could be used to construct 
transgenic plants whose seeds contained little or no napin. 

The same group (Kohno-Murase et al. (1995) Theoret. Applied Genetics 
91:621-631) attempted to reduce cruciferin (1 IS globulin) expression in Brassica 
napus by expressing an antisense form of a cruciferin gene (cruA, encoding an 
10 alpha 2/3 isoform) under the control of the napA promoter. In this case the results 
were more complex. The cruciferins are divided into three subclasses based on 
sequence identity (alpha 1, 2/3, and 4); the classes each have from 60-75% 
sequence identity with each other (Rodin, J. et al. (1992) Plant Mol. Biol. 
20:559-563). Expression of the antisense gene encoding the alpha 2/3 isoform 
15 resulted in lower levels of the alpha 1 and 2/3 forms. However, there was no 
reduction in the expression of the alpha 4 class. 

Antisense technology was used to reduce the level of the seed storage 
protein, glutelin, in rice. Expression of the seed specific glutelin promoter 
operably linked to the full length antisense glutelin coding region resulted in about 
20 a 25% reduction in glutelin protein levels (U.S. Patent No. 5,516,668). 

SUMMARY OF THE INVENTION 
The instant invention provides a method for reducing the quantity glycinin 
or p-conglycinin (1 IS or 7S globulins, respectively) seed storage proteins in 
soybeans. In one embodiment, cosuppression technology was used to suppress 
25 the expression of genes encoding the 7S-giobulin class of seed protein genes. 
Genes encoding either two (a and a') or all three subclasses (a, a' and P) of 7S 
globulins were suppressed by expression of the gene encoding a single subclass 
(a) of P-conglycinin, resulting in soybean lines with altered seed storage profiles. 
In another embodiment, a method for supressing two completely different genes, 
30 only one of which is a seed protein gene, is presented, allowing for multiple 
changes in seed composition. Surprisingly, expression of a chimeric gene 
comprising the promoter region of a soybean seed storage protein operably linked 
to the coding region of a soybean gene whose expression alters the fatty acid 
profile of transgenic soybean seeds resulted in simultaneous alteration of two 
35 distinct phenotypic traits: seed storage protein profile and seed oil profile. 

The method for reducing the quantity of soybean seed storage protein 
taught herein comprises the following steps: 

(a) constructing a chimeric gene comprising (i) a nucleic acid 
fragment encoding a promoter that is functional in the cells of 

4 
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soybean seeds, (ii) a nucleic acid fragment encoding all or a 
portion of a soybean seed storage protein placed in sense or 
antisense orientation relative to the promoter of (i), and (iii) a 
transcriptional termination region; 
5 (b) creating a transgenic soybean cell by introducing into a soybean 

cell the chimeric gene of (a); and 
(c) growing the transgenic soybean cells of step (b) under conditions 
that result in expression of the chimeric gene of step (a) 
wherein the quantity of one or more members of a class of soybean seed storage 
10 protein subunits is reduced when compared to soybeans not containing the 
chimeric gene of step (a), 

DETAILED DESCRIPTION OF THE INVENTION 
BRIEF DESCRIPTION OF THE SEQUENCE DESCRIPTIONS 
The invention can be more fully understood from the following detailed 
1 5 description and the Sequence Descriptions which form a part of this application. 
The Sequence Descriptions contain the three letter codes for amino acids as 
defined in 37 C.F.R. 1 .822 which are incorporated herein by reference. 

SEQ ID NO: 1 shows the 5' to 3' nucleotide sequence encoding the a 
subunit of the P-conglycinin soybean seed storage protein. 
20 SEQ ID NO:2 shows the 5' to 3* nucleotide sequence encoding the a* 

subunit of the P-conglycinin soybean seed storage protein. 

SEQ ID NO:3 shows the 5' to 3' nucleotide sequence encoding the p 
subunit of the p-conglycinin soybean seed storage protein. 

SEQ ID NOS:4 and 5 show the nucleotide sequences of the PCR primers 
25 ConS and Con 1 .4a (respectively) used to isolate nucleic acid fragments encoding 
the a and a' subunits of the P-conglycinin soybean seed storage protein. 

SEQ ID NOS:6 and 7 show nucleotide sequences of the PCR primers 
Con.09 and Con. 8 (respectively) used to distinguish nucleic acid fragments 
encoding the a and a' subunits of the p-conglycinin soybean seed storage protein. 
30 SEQ ID NOS:8 and 9 show the nucleotide sequences of the PCR primers 

ConSa and Conl .9a (respectively) used to isolate full length cDNAs encoding the 
a and a' subunits of the p-conglycinin soybean seed storage protein. 

SEQ ID NO: 10 shows the nucleotide sequence of the PCR primer Con. 1.0 
used to confirm the full length cDNA encoding the a and a' subunits of the 
35 p-conglycinin soybean seed storage protein. 

SEQ ID NOS:l 1, 12 and 13 show the 5' to 3' nucleotide sequences 
encoding the Gy 1 , Gy2 and Gy3 subunits (respectively) of the group I glycinin 
soybean seed storage protein. 



5 
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SEQ ID NOS:14 and 15 show the 5' to 3' nucleotide sequences encoding 
the Gy4 and Gy5 subunits (respectively) of the group II glycinin soybean seed 
storage protein. 

SEQ ID NOS: 16, 17 and 18 show the nucleotide sequences of the PCR 
5 primers Gl-1, G 1-1039 and G 1-1475 (respectively) used to isolate the cDNAs 
encoding the subunits of the group I glycinin soybean seed storage protein. 

SEQ ID NOS:19, 20 and 21 show the nucleotide sequences of the PCR 
primers G4-7, G4- 1251, and G4-1670 (respectively) used to isolate the cDNA 
encoding the subunits of the group II glycinin soybean seed storage protein. 
10 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a restriction map of plasmid pML70, used as an intermediate 
cloning vehicle in construction of chimeric genes of the instant invention. 

Figure 2 is a restriction map of plasmid pCW109, used as an intermediate 
cloning vehicle in construction of chimeric genes of the instant invention. 
1 5 Figure 3 is a restriction map of plasmid pKS 1 8HH, used as an intermediate 

cloning vehicle in construction of chimeric genes of the instant invention. 

Figure 4 is a restriction map of plasmid pJol , This plasmid was derived 
by cloning the plant transcriptional unit KTi promoter/truncated a subunit of 
P-conglycinin/KTi 3' end into the BamH I site of pKS18HH. 
20 Figure 5 is an SDS-PAGE gel of extracted protein from somatic embryos 

transformed with pJol. 

Figure 6 is a restriction map of plasmid pBS43. This plasmid comprises a 
nucleic acid sequence encoding the Glycine max microsomal delta- 12 desaturase 
under the transcriptional control of the soybean P-conglycinin promoter. 
25 Figure 7 is an SDS-PAGE gel of extracted protein from soybean seeds 

obtained from plants transformed with pBS43. 

Figure 8 is a restriction map of plasmid pJo3. This plasmid was derived 
by cloning the plant transcriptional unit KTi promoter/full length cDNA of the a 
subunit of P-conglycinin/KTi 3' end into the Hindlll site of pKS18HH. 
30 Figure 9 is a restriction map of plasmid pRB20. This plasmid was derived 

by cloning the transcriptional unit p-conglycinin promoter/Phaseolin 3' end into 
the Hindlll site of pKSl8HH. It is used as an intermediate cloning vehicle in 
construction of chimeric genes of the instant invention. 

BIOLOGICAL DEPOSITS 
35 The following plasmids have been deposited under the terms of the 

Budapest Treaty at American Type Culture Collection (ATCC), 12301 Parklawn 
Drive, Rockville, MD 20852, and bear the following accession numbers: 
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Plasmid Accession Number Date of Deposit 

p Jo 1 ATCC 976 1 4 June 15, 1996 

pBS43 ATCC 976 1 9 June 1 9, 1 996 

P Jo3 ATCC 976 1 5 June 1 5, 1 996 



DEFINITIONS 

In the context of this disclosure, a number of terms shall be used. The 
term "nucleic acid" refers to a large molecule which can be single-stranded or 
5 double-stranded, composed of monomers (nucleotides) containing a sugar, a 
phosphate and either a purine or pyrimidine. A "nucleic acid fragment" is a 
fraction of a given nucleic acid molecule. In higher plants, deoxyribonucleic acid 
(DNA) is the genetic material while ribonucleic acid (RNA) is involved in the 
transfer of the information in DNA into proteins. A "genome" is the entire body 

10 of genetic material contained in each cell of an organism. The term "nucleotide 
sequence" refers to the sequence of DNA or RNA polymers, which can be single- 
or double-stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases capable of incorporation into DNA or RNA polymers. 

As used herein, the term "homologous to" refers to the relatedness 

1 5 between the nucleotide sequence of two nucleic acid molecules or between the 
amino acid sequences of two protein molecules. Estimates of such homology are 
provided by either DNA-DNA or DNA-RNA hybridization under conditions of 
stringency as is well understood by those skilled in the art (Hames and Higgins, 
Eds. (1985) Nucleic Acid Hybridisation, IRL Press, Oxford, U.K.); or by the 

20 comparison of sequence similarity between two nucleic acids or proteins, such as 
by the method of Needleman et al. ((1970) J. Mol Biol 4?:443-453). 

As used herein, "essentially similar" refers to DNA sequences that may 
involve base changes that do not cause a change in the encoded amino acid, or 
which involve base changes which may alter one or more amino acids, but do not 

25 affect the functional properties of the protein encoded by the DNA sequence. It is 
therefore understood that the invention encompasses more than the specific 
exemplary sequences. Modifications to the sequence, such as deletions, 
insertions, or substitutions in the sequence which produce silent changes that do 
not substantially affect the functional properties of the resulting protein molecule 

30 are also contemplated. For example, alteration in the gene sequence which reflect 
the degeneracy of the genetic code, or which results in the production of a 
chemically equivalent amino acid at a given site, are contemplated; thus, a codon 
for the amino acid alanine, a hydrophobic amino acid, may be substituted by a 
codon encoding another hydrophobic amino acid residue such as glycine, valine, 

35 leucine, or isoleucine. Similarly, changes which result in substitution of one 

7 
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negatively charged residue for another, such as aspartic acid for glutamic acid, or 
one positively charged residue lor another, such as lysine for arginine. can also be 
expected to produce a biologically equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the protein 
5 molecule would also not be expected to alter the activity of the protein. In some 
cases, it may in fact be desirable to make mutants of the sequence in order to study 
the effect of alteration on the biological activity of the protein. Each of the 
proposed modifications is well within the routine skill in the art, as is 
determination of retention of biological activity of the encoded products. 
10 Moreover, the skilled artisan recognizes that "essentially similar" sequences 

encompassed by this invention can also defined by their ability to hybridize, under 
stringent conditions (0.1X SSC, 0.1% SDS, 65°C), with the sequences 
exemplified herein. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, 
1 5 including regulatory sequences preceding (5' non-coding) and following (3' non- 
coding) the coding region. "Native" gene refers to an isolated gene with its own 
regulatory sequences as found in nature. "Chimeric gene" refers to a gene that 
comprises heterogeneous regulatory and coding sequences not found in nature. 
"Endogenous" gene refers to the native gene normally found in its natural 
20 location in the genome and is not isolated. A "foreign" gene refers to a gene not 
normally found in the host organism but that is introduced by gene transfer. 

"Coding sequence" or "coding region" refers to a DNA sequence that 
codes for a specific protein and excludes the non-coding sequences. It may 
constitute an "uninterrupted coding sequence", i.e., lacking an intron or it may 
25 include one or more introns bounded by appropriate splice junctions. An "intron" 
is a nucleotide sequence that is transcribed in the primary transcript but that is 
removed through cleavage and re-ligation of the RNA within the cell to create the 
mature mRNA that can be translated into a protein. 

"Initiation codon" and "termination codon" refer to a unit of three 
30 adjacent nucleotides in a coding sequence that specifies initiation and chain 
termination, respectively, of protein synthesis (mRNA translation). "Open 
reading frame" refers to the coding sequence uninterrupted by introns between 
initiation and termination codons that encodes an amino acid sequence. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
35 catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary 
transcript or it may be a RNA sequence derived from posttranscriptional 
processing of the primary transcript and is referred to as the mature RNA. 
"Messenger RNA (mRNA)" refers to the RNA that is without introns and that 



WO 97/4773 1 PCT/US97/09743 

can be translated into protein by the cell. "cDNA" refers to a double-stranded 
DNA that is complementary to and derived from mRNA. "Sense" RNA refers to 
RNA transcript that includes the mRNA. " Antisense RNA" refers to a RNA 
transcript that is complementary to all or part of a target primary transcript or 
5 mRNA and that blocks the expression of a target gene. The complementarity of 
an antisense RNA may be with any part of the specific gene transcript, i.e., at the 
5' non-coding sequence, 3' non-coding sequence, introns, or the coding sequence. 

As used herein, "suitable regulatory sequences" refer to nucleotide 
sequences in native or chimeric genes that are located upstream (5'), within, or 

10 downstream (3') to the nucleic acid fragments of the invention, which control the 
expression of the nucleic acid fragments of the invention. The term "expression", 
as used herein, refers to the transcription and stable accumulation of the sense 
(mRNA) or the antisense RNA derived from the nucleic acid fragment(s) of the 
invention that, in conjunction with the protein apparatus of the cell, results in 

1 5 altered phenotypic traits. Expression of the gene involves transcription of the 
gene and translation of the mRNA into precursor or mature proteins. "Antisense 
inhibition" refers to the production of antisense RNA transcripts capable of 
preventing the expression of the target protein. "Overexpression" refers to the 
production of a gene product in transgenic organisms that exceeds levels of 

20 production in normal or non-transformed organisms. "Cosuppression" refers to 
the expression of a foreign gene which has substantial homology to an 
endogenous gene resulting in the suppression of expression of both the foreign 
and the endogenous gene. "Altered levels" refers to the production of gene 
product(s) in transgenic organisms in amounts or proportions that differ from that 

25 of normal or non-transformed organisms. The skilled artisan will recognize that 
the phenotypic effects contemplated by this invention can be achieved by 
alteration of the level of gene product(s) produced in transgenic organisms relative 
to normal or non-transformed organisms, namely a reduction in gene expression 
mediated by antisense suppression or cosuppression. 

30 "Promoter" refers to a DNA sequence in a gene, usually upstream (5') to 

its coding sequence, which controls the expression of the coding sequence by 
providing the recognition for RNA polymerase and other factors required for 
proper transcription. In artificial DNA constructs, promoters can also be used to 
transcribe antisense RNA. Promoters may also contain DNA sequences that are 

35 involved in the binding of protein factors which control the effectiveness of 

transcription initiation in response to physiological or developmental conditions. 
It may also contain enhancer elements. An "enhancer" is a DNA sequence which 
can stimulate promoter activity. It may be an innate element of the promoter or a 
heterologous element inserted to enhance the level or tissue-specificity of a 

9 
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promoter. "Constitutive promoters" refers to those that direct gene expression in 
all tissues and at all times. "Tissue-specific" or "development-specific" 
promoters as referred to herein are those that direct gene expression almost 
exclusively in specific tissues, such as leaves or seeds, or at specific development 
5 stages in a tissue, such as in early or late embryogenesis, respectively. 

The "3' non-coding sequences" refers to the DNA sequence portion of a 
gene that contains a polyadenylation signal and any other regulatory signal 
capable of affecting mRNA processing or gene expression. The polyadenylation 
signal is usually characterized by affecting the addition of polyadenylic acid tracts 
10 to the 3' end of the mRNA precursor. 

The term "operably linked" refers to nucleic acid sequences on a single 
nucleic acid molecule which are associated so that the function of one is affected 
by the other. For example, a promoter is operably linked with a structural gene 
when it is capable of affecting the expression of that structural gene (i.e., that the 
15 structural gene is under the transcriptional control of the promoter). 

"Transformation" refers to the transfer of a nucleic acid fragment into the 
genome of a host organism, resulting in genetically stable inheritence. Host 
organisms containing the transformed nucleic acid fragments are referred to as 
"transgenic" organisms. 
20 This invention concerns the construction of transgenic soybean lines 

wherein the expression of genes encoding seed storage proteins are modulated to 
effect a change in seed storage protein profile of transgenic plants. Modification 
of the seed storage protein profile can result in production of novel soy protein 
products with unique and valuable functional characteristics. 
25 Gene expression in plants uses regulatory sequences that are functional in 

such plants. The expression of foreign genes in plants is well-established 
(De Blaere et al. (1987) Meth Enzymol 755:277-291). The source of the 
promoter chosen to drive the expression of the fragments of the invention is not 
critical provided it has sufficient transcriptional activity to accomplish the 
30 invention by decreasing the expression of the target seed storage protein genes. 
Preferred promoters include strong constitutive plant promoters, such as those 
directing the I9S and 35S transcripts in cauliflower mosaic virus (Odell, J. T. et 
al, (1985) Nature 573:810-812; Hull et al. (1987) Virology Sd: 4 82-493). 
Particularly preferred promoters are those that allow seed-specific expression. 
35 Examples of seed-specific promoters include, but are not limited to, the promoters 
of seed storage proteins, which can represent up to 90% of total seed protein in 
many plants, The seed storage proteins are strictly regulated, being expressed 
almost exclusively in seeds in a highly tissue-specific and stage-specific manner 
(Higgins et al. (1984) Ann. Rev. Plant Physiol. JJ:191-221 ; Goldberg et al. (1989) 
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Cell id: 149-160). Moreover, different seed storage proteins may be expressed at 
different stages of seed development. 

Expression of seed-specific genes has been studied in great detail (See 
reviews by Goldberg et aL (1989) Cell 5*5:149-160 and Higgins et al. {\9U)Ann. 
5 Rev. Plant PhysioL 55:191-221). There are currently numerous examples of seed- 
specific expression of seed storage protein genes (natural or chimeric) in 
transgenic dicotyledonous plants; in general, temporal and spatial expression 
patterns are maintained. The promoters used in such examples could potentially 
be used to affect the present invention. These include genes from dicotyledonous 

1 0 plants for bean P-phaseolin (Sengupta-Gopalan et al.(I985) Proc. Natl Acad. ScL 
USA 52:3320-3324; Hoffman et al. (1988) Plant Moi Biol 7/:717-729), bean 
lectin (Voelker et al. (1987) EMBO 1 6:3571-3577), soybean lectin (Okamuro et 
al. (1986) Proc. Natl Acad. ScL USA 53:8240-8244), soybean Kunitz trypsin 
inhibitor (Perez-Grau et al. (1989) Plant Cell 7:095-1 109), soybean p-conglycinin 

15 (Beachy et al. (1985) EMBO J. 4:3047-3053; pea vicilin (Higgins et al. (1988) 
Plant Mol Biol 77:683-695), pea convicilin (Newbigin et al. (1990) Planta 
750:461-470), pea legumin (Shirsat et al. (1989) Mol Gen. Genetics 
275:326-331), rapeseed napin (Radke et al. (1988) Theor. Appl Genet. 
75:685-694) and Arabidopsis thaliana 2S albumin (Vandekerckhove et al. (1989) 

20 Bio/Technology 7:929-932). 

Of particular use in the expression of the nucleic acid fragment of the 
invention will be the heterologous promoters from several soybean seed storage 
protein genes such as those for the Kunitz trypsin inhibitor (KTi; Jofuku et al. 
(1989) Plant Cell 7:1079-1093; glycinin (Nielson et al. (1989) Plant Cell 

25 7:313-328), and P-conglycinin (Harada et al. (1989) Plant Cell 7:415-425). The 
skilled artisan will recognize that attention must be paid to differences in temporal 
regulation endowed by different seed promoters. For example, the promoter for 
the ot-subunit gene is expressed a few days before that for the p-subunit gene 
(Beachy et al. (1985) EMBO J. 4:3047-3053), so that the use of the P-subunit gene 

30 is likely to be less useful for suppressing a-subunit expression. 

Also of potential use, but less preferred, will be the promoters of genes 
involved in other aspects of seed metabolism, such as lipid or carbohydrate 
biosynthesis. In summary, the skilled artisan will have no difficulty in 
recognizing that any promoter of sufficient strength and appropriate temporal 

35 expression pattern can potentially be used to implement the present invention. 
Similarly, the introduction of enhancers or enhancer-like elements into the 
promoter regions of either the native or chimeric nucleic acid fragments of the 
invention would result in increased expression to accomplish the invention. This 
would include viral enhancers such as that found in the 35S promoter (Odell et al. 
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{ 1 988) Plant Mol Biol /0:263-272). enhancers from the opine genes (Fromm et 
al. ( 1 989) Plant Cell /:977-984), or enhancers from any other source that result in 
increased transcription when placed into a promoter operably linked to the nucleic 
acid fragment of the invention. 

Of particular importance is the DNA sequence element isolated from the 
gene encoding the oc-subunit of p-conglycinin that can confer a 40-fold, seed- 
specific enhancement to a constitutive promoter (Chen et al. (1 989) Dev. Genet 
/0: 1 1 2- 1 22). One skilled in the art can readily isolate this element and insert it 
within the promoter region of any gene in order to obtain seed-specific enhanced 
expression with the promoter in transgenic plants. Insertion of such an element in 
any seed-specific gene that is normally expressed at times different than the 
P-conglycinin gene will result in expression of that gene in transgenic plants for a 
longer period during seed development. 

Any 3' non-coding region capable of providing a polyadenylation signal 
and other regulatory sequences that may be required for the proper expression of 
the nucleic acid fragments of the invention can be used to accomplish the 
invention. This would include 3' ends of the native fatty acid desaturase(s), viral 
genes such as from the 35S or the 19S cauliflower mosaic virus transcripts, from 
the opine synthesis genes, ribulose 1 ,5-bisphosphate carboxylase, or chlorophyll 
a/b binding protein. There are numerous examples in the art that teach the 
usefulness of different 3' non-coding regions. 

Various methods of transforming cells of higher plants according to the 
present invention are available to those skilled in the art (see European Patent 
Publications EP-A-295,959 and EP-A-3 1 8,341). Such methods include those 
based on transformation vectors utilizing the Ti and Ri plasmids of 
Agrobacterium spp. It is particularly preferred to use the binary type of these 
vectors. Ti-derived vectors transform a wide variety of higher plants, including 
monocotyledonous and dicotyledonous plants (Sukhapinda et al. (1987) Plant 
Mol Biol &209-216; Potrykus, (1985) Mol Gen. Genet. I99AS3). Other 
transformation methods are available to those skilled in the art, such as direct 
uptake of foreign DNA constructs (see European Patent Publication 
EP-A-295,959), techniques of electroporation (Fromm et al. (1986) Nature 
(London) 3/9:791) or high-velocity ballistic bombardment with metal particles 
coated with the nucleic acid constructs (Klein et al. (1987) Nature (London) 
327:70). Once transformed, the cells can be regenerated by those skilled in the 
art. Of particular relevance are the recently described methods to transform 
soybean, including McCabe et al. ((1988) Bio/Technology (5:923-926), Finer et al. 
((1991) In Vitro Cell Dev. Biol 27:175-182) and Hinchee, M.A.W. ((1988) 
Bio/Technology 6:9\ 5-922). 

12 
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Once transgenic plants are obtained by one of the methods described 

above, it is necessary to screen individual transgenics for those that most 
effectively display the desired phenotype. It is well known to those skilled in the 
art that individual transgenic plants carrying the same construct may differ in 
5 expression levels; this phenomenon is commonly referred to as "position effect". 
Thus, in the present invention different individual transformants may vary in the 
effectiveness of suppression of the target seed protein. The person skilled in the 
art will know that special considerations are associated with the use of antisense or 
cosuppresion technologies in order to reduce expression of particular genes. U.S. 

10 Patent Nos. 5,190,931, 5,107,065 and 5,283,323 have taught the feasibility of 
these techniques, but it is well known that their efficiency is unpredictable. 
Accordingly, the person skilled in the art will make multiple genetic constructs 
containing one or more different parts of the gene to be suppressed, since the art 
does not teach a method to predict which will be most effective for a particular 

1 5 gene. Furthermore, even the most effective constructs will give an effective 
suppression phenotype only in a fraction of the individual transgenic lines 
isolated. For example, World Patent Publications W093/1 1245 and W094/1 1516 
teach that when attempting to suppress the expression of fatty acid desaturase 
genes in canola, actual suppression was obtained in less than 1% of the lines 

20 tested. In other species the percentage is somewhat higher, but in no case does the 
percentage reach 100. This should not be seen as a limitation on the present 
invention, but instead as practical matter that is appreciated and anticipated by the 
person skilled in this art. Accordingly, the skilled artisan will develop methods 
for screening large numbers of transformants. The nature of these screens will 

25 generally be chosen on practical grounds, and is not an inherent part of the 
invention. A preferred method will be one which allows large numbers of 
samples to be processed rapidly, since it will be expected that the majority of 
samples will be negative. 

The mechanism of cosuppression remains unclear (for one review and 

30 speculation, see Flavcll, R. (1994) Proc. Natl Acad Sci USA 97:3490-3496), and 
therefore the exact requirments to induce it when desired are also unclear. Most 
examples found in the literature involve the use of all or a large part of the 
transcribed region of the gene to be cosuppressed to elicit the desired response. 
However, in at least one case (Brusslan et al. (1993) Plant Cell 5:667-671; 

35 Brusslan and Tobin (1995) Plant Mol. Biol 27:809-813), that of the cabl40 gene 
of Arabidopsis, the use of the promoter (as a 1.3 kb fragment) and just 14 bp of 
transcribed region fused to a completely unrelated gene was sufficient to result in 
cosuppression of the endogenous cab 1 40 gene as well as the introduced chimeric 
gene. This result is unusual and apparantly quite unpredictable, as numerous other 
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promoter-leader (the 5' untranslated leader being defined as the region between the 
start of transcription and the translation initiation codon) units have been used to 
drive chimeric genes successfully. Flavell speculates that some or many genes 
(including members of multigene families such as those encoding seed proteins) 
5 may have evolved so as to avoid the mechanisms of cosuppression, while others 
have not, providing a potential further level of regulation as genomes evolve. 
Thus, the instant observation that the promoter and leader of the conglycinin gene 
can be used to suppress expression of endogenous conglycins while the other 
portion of the transgene (beyond the initiation codon) can be used to suppress a 
1 0 completely unrelated gene is unique. 

EXAMPLES 

The present invention is further defined by the following examples. It will 
be understood that the examples are given for illustration only and the present 
invention is not limited to uses described in the examples. The present invention 

15 can be used to generate transgenic soybean plants with altered levels of various 
seed storage proteins. From the above discussion and the following examples, one 
skilled in the art can ascertain, and without departing from the spirit and scope 
thereof, can make various changes and modifications of the invention to adapt it to 
various usages and conditions. All such modifications are intended to fall within 

20 the scope of the intended claims. 

Detailed procedures for DNA manipulation, such as use of restriction 
endonuclease enzymes, other modifying enzymes, agarose gel electrophoresis, 
nucleic acid hybridization, and transformation of E. colt with plasmid DNA are 
described in Sambrook et al. (1989) Molecular Cloning, A Laboratory manual, 

25 2nd ed, Cold Spring Harbor Laboratory Press (hereinafter "Maniatis"). All 

restriction enzymes and other modifying enzymes were obtained from Gibco BRL 
(Gaithersburg, MD). 

EXAMPLE 1 

To determine whether the expression of p-conglycinin in developing 
30 soybean cotyledons could be the target of cosuppression, truncated cDNA 

fragments of the a and a' subunits of p-conglycinin were prepared using a reverse 
transcriptase polymerase chain reaction kit (Geneamp™ RNA PCR Kit; Perkin 
Elmer Cetus). The upper primer, ConS, is homologous to nucleotides 5-19 of the 
a and a' subunit cDNA sequences obtained from the EMBL/GenBank/DDBJ 
35 databases. To aid cloning, additional nucleotides were added to the 5* end to code 
for an Nco I restriction site. The lower primer, Con 1 .4a, is complementary to 
nucleotides 1370-1354 ofSEQIDNO:l and 1472-1456 of SEQ ID NO:2, 
representing the sequences of the a and a' cDNAs, respectively. To aid in 
cloning, additional nucleotides were added to the 5' end to introduce a Kpn I 

14 
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The nucleotide sequences of PCR primers ConS and Con 1 ,4a are 



ConS S'-CGTACCATGGTGAGAGCGCGGTTCCO' (SEQ ID NO:4) 
Nco I 

Conl.4a 5 '-CGGTACCGAATTGAAGTGTGGTAG-3 * (SEQIDNO:5) 
Kpn I 

RNA isolated from developing soybean seeds was reverse-transcribed 
using either the kit-supplied random hexamers, or Con 1.4a, following the 
manufacturer's protocol. The resulting cDNA fragments were amplified in the 
PCR (Polymerase Chain Reaction) reaction using a mixture of ConS and Conl .4a. 
Reactant concentrations were as described in the manufacturer's protocols. The 
following program was used: a) one cycle of 2 minutes at 95°C; b) 35 cycles of: 
1 .5 minutes at 50°C (annealing), 5 minutes at 70°C (extension), 1 .5 minutes at 
95°C (denaturation); and c) one cycle of 2 minutes at 50°C followed by 
10 minutes at 68 °C. Fifteen microliters of each of the PCR reaction mixes was 
analyzed by agarose gel electrophoresis. Reactions resulted in PCR products of 
the expected sizes: 1 .47 kb for a' and 1 .37 kb for a. The truncated cDNA 
fragments from the remainder of the reaction mixes were purified using the 
Wizard™ PCR Preps DNA Purification System kit (Promega), 

The purified reaction mix containing the a and a' fragments, which 
because of the primers used, included Nco I restriction sites at the 5* ends and 
Kpn I restriction sites at the 3' ends, were digested with Kpn I and Nco I 
restriction enzymes. The a cDNA fragment was recovered following gel 
electrophoresis, designated as fragment F8, and directionally cloned (sense 
orientation) into pCW109 (Fig. 1) and pML70 (Fig. 2) using the Nco I to Kpn I 
sites present in both plasmids. F8 was confirmed as a by PCR using a nested set 
of primers (Con.09 and Con.8) internal to ConS and Conl. 4a, and distinguished 
from a' by digestion of pCW!09/F8 plasmid with Hind III, Nco I, Kpn I, and Pst I 
(a does not contain a Pst I site whereas a' does). 

Con.09 5'-TCGTCCATGGAGCGCGGTTCCCATTAC-3* (SEQ IDNO:6) 

Con.8 5'-TCTCGGTCGTCGTTGTT-3 ' (SEQ ID NO:7) 

The transcriptional unit KTi promoter/truncated ot/KTi 3 r end was released 
from plasmid pML70/F8 by restriction digest with BamHI, gel isolated, and 
labeled as Fl 1. Fl 1 was then cloned into pKS18HH (Fig. 3) at the BamH I site. 
pKS18HH is a plasmid construction containing the following genetic elements: 
(i) T7 promoter/Hygromycin B Phosphotransferase (HPT)/T7 Terminator 
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Sequence; (ii) 35S promoter from cauliflower mosaic virus (CaMV)/Hygromycin 
B Phosphotransferase (HPT)ZNopaline Synthase (NOS) 3' from Agrobacterium 
tumefaciens T-DNA; and (iii) pSP72 plasmid vector (Promega) with beta- 
iactamase coding region removed. One skilled in the art of molecular biology can 
5 ligate the above three components into a single plasmid vector using well known 
protocols (Maniatis). 

The Hygromycin B Phosphotransferase (HPT) gene was isolated by PCR 
amplification from E. coli strain W677 containing a Klebsiella-derived plasmid 
pJR225 (Gritz L., and Davies J. ( 1 983) Gene 25: 1 79- 1 88). pKS 1 8HH contains 

10 the CaMV 35S/HPT/NOS cassette for constitutive expression of the HPT enzyme 
in plants, such as soybean. The pKS 1 8HH plasmid also contains the T7 
promoter/HPT/T7 terminator cassette for expression of the HPT enzyme in certain 
strains of £. coli, such as NovaBIue™ (DE3) (Novagen) that are lysogenic for 
lambda DE3 (which carries the T7 RNA Polymerase gene under lacUVS control). 

1 5 pKS 1 8HH also contains three unique restriction endonuclease sites suitable for 
cloning of genes into this vector. Thus, the pKS18HH plasmid vector allows the 
use of Hygromycin B for selection in both E. coli and plants. Confirmation of 
insertion and orientation of the Fl 1 fragment was accomplished by digestion with 
Hindlll. A clone with the Fl 1 fragment in clockwise orientation was selected and 

20 labeled pJol (Fig. 4). 

Transformation of Somatic Embryo Cultures 
The following stock solutions and media were used for transformation and 
propogation of soybean somatic embryos: 



Stock Solutions 
MS Sulfate IQOx stock 
MgS0 4 7H 2 0 
MnS0 4 H 2 0 
ZnS0 4 7H 2 0 
CuS0 4 5H 2 0 

MS Halides IQOx stock 



CaCl 2 
KI 

CoCl 2 

KH 2 P0 4 

H3BO3 

Na 2 Mo0 4 



2H 2 0 



6H 2 0 



2H 2 0 



Media 

(g/L) SB55 foer Liter) 

37.0 1 0 mL of each MS stock 

1 .69 1 mL of B5 Vitamin stock 

0.86 0.8 g NH 4 N0 3 

0.0025 3.033 g KN0 3 

1 mL2,4-D(10mg/mL 
stock) 

44.0 0.667 g asparagine 

0.083 pH 5.7 

0.00125 

17.0 SB 103 (per Liter) 

0.62 I pk. Murashige & Skoog salt 

mixture (Gibco BRL) 

0.025 60 g maltose 
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Na 2 EDTA 
FeS0 4 
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7FhO 



3.724 
2.784 



B5 Vitamin stock 
myoinositol 

nicotinic acid 
pyridoxine HC1 
thiamine 



100.0 

1.0 
1.0 
10.0 



2 g gelrite 
pH5.7 

(For SB 103 plus charcoal, 
add 5 g charcoal) 

SB 148 (per Liter) 

1 pk. Murashige & Skoog 
salt mixture (Gibco BRL) 

60 g maltose 

1 mL B5 vitamin stock 

7 g agarose 
pH5.7 



Soybean embryonic suspension cultures were maintained in 35 mL liquid 
media (SB55) on a rotary shaker (150 rpm) at 28°C with a mix of fluorescent and 
incandescent lights providing a 16/8 h day/night schedule. Cultures were 
5 subcultured every 2 to 3 weeks by inoculating approximately 35 mg of tissue into 
35 mL of liquid media. 

Soybean embryonic suspension cultures were transformed with pJol by 
the method of particle gun bombardment (see Klein et al. (1987) Nature 327:70). 
A DuPont Biolistic™ PDSIOOO/He instrument was used for these transformations. 

10 Five \\L of pJol plasmid DNA (1 ug/ul), 50 uL CaCl 2 (2.5 M), and 20 uL 

spermidine (0.1 M) were added to 50 uL of a 60 mg/mL 1 mm gold particle 
suspension. The particle preparation was agitated for 3 minutes, spun in a 
microfuge for 10 seconds and the supernatant removed. The DNA-coated 
particles were then washed once with 400 uL 70% ethanol and resuspended in 

15 40 |iL of anhydrous ethanol. The DN A/particie suspension was sonicated three 
times for 1 second each. Five uL of the DNA-coated gold particles were then 
loaded on each macro carrier disk. 

Approximately 300 to 400 mg of two week old suspension culture was 
placed in an empty 60 mm x 1 5 mm petri dish and the residual liquid removed 

20 from the tissue by pipette. The tissue was placed about 3.5 inches away from the 
retaining screen and bombarded twice. Membrane rupture pressure was set at 
1 000 psi and the chamber was evacuated to -28 inches of Hg. Two plates were 
bombarded per construct per experiment. Following bombardment, the tissue was 
divided in half and placed back into liquid media and cultured as described above. 

25 Fifteen days after bombardment, the liquid media was exchanged with 

fresh SB55 containing 50 mg/mL hygromycin. The selective media was refreshed 
weekly. Six weeks after bombardment, green, transformed tissue was isolated and 
inoculated into flasks to generate new transformed embryonic suspension cultures. 
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Transformed embryonic clusters were removed from liquid culture media 
and placed on a solid agar media, SB 103, plus 0.5% charcoal to begin maturation. 
After 1 week, embryos were transferred to SB 103 media minus charcoal. After 
3 weeks on SB 103 media, maturing embryos were separated and placed onto 
5 SB 148 media. Conditions during embryo maturation were 26° C, with a mix of 
fluorescnt and incandescent lights providing a 16/8 h day/night schedule. After 
6 weeks on SB 1 48 media, embryos were analyzed for the expression of the 
P-conglycinin subunit proteins. Each embryonic cluster gave rise to 5 to 
20 somatic embryos. 

10 Analysis of Transformed Somatic Embryos 

Initial experiments were performed to determine when the a, a' and P 
subunits of p-conglycinin could be visualized during somatic embryo maturation 
by SDS-PAGE gel electrophoresis. Cotyledons of non-transformed embryos 
(generated as above, except they did not undergo bombardment) were dissected 

15 from embryos at 6, 8, 10, and 12 weeks after initiating maturation and kept frozen 
at -80°C until analyzed. Cotyledonary tissue was weighed, 10 uL/mg tissue of 
extraction buffer was added, and the tissue ground in a Pellet Pestle Disposable 
Mixer (Kimble/Kontes). Extraction buffer consisted of 50 mM Tris-HCl (pH 7.5), 
10 mM p-mercaptoethanol (BME), and 0.1% SDS. The samples were then 

20 microfuged at 12,000 rpm for 10 minutes and supernatant remove to a new 
microfuge tube by pipette. Extracts were kept frozen at -20° C until used. 

For SDS-PAGE analysis, 8 of (2x) loading buffer was added to 8 u-L of 
sample extract. The (2x) loading buffer consisted of 100 mM Tris-HCl (pH 7.5), 
4% SDS, 0.2% bromophenol blue, 1 5% glycerol, and 200 mM PME. The mixture 

25 was heated at 95°C for 4 minutes. Sample mixes were then microfuged 
(12,000 rpm for 20 seconds) and loaded onto a 10% precast Ready Gel™ 
(Bio-Rad) that was assembled into a mini-Protein II Electrophoresis Cell 
(Bio-Rad). Bio-Rad Tris/Glycine/SDS Buffer was used as the running buffer and 
voltage was a constant 125V. In addition to sample extracts, each gel contained 

30 one lane with a molecular weight standard (Bio-rad SDS-PAGE standard, low 
range) and one lane with total soybean seed protein extracted from commercial 
defatted soy flour. Upon completion, the gels were stained with Coomassie 
Brilliant Blue and destained (Maniatis) in order to visualize proteins. Gels were 
photographed, placed in a sealed bag with water, and stored in the refrigerator. 

35 Results indicated that the a, a' and P subunits of p-conglycinin were detectable in 
the cotyledons of somatic embryos between 8 and 10 weeks after the start of 
maturation. 

Analysis of transformed embryos was carried out at 10 weeks after the 
start of maturation using the methods described above. Two embryos per clone 
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were analyzed initially. Additional embryos were analyzed if suppression of the 
P-conglycinin subunits was observed in the two embryos. Table 1 presents the 
results of this analysis, wherein the presence or absence of each P-conglycinin 
subunit is indicated by a (+) or (-), respectively. 

5 

TABLE 1 



Clone 


Embryo 


a 


a* 




JoM 


1 






+ 




2 


— 


— 


+ 




3 


+ 


+ 


+ 




4 




- 


+ 




5 


+ 


+ 


+ 


Jol-2 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jo 1-3 


1 


+ 


+ 


+ 




2 




+ 


+ 


JoM 


1 


- 


- 


- 




2 


- 


- 


- 




3 


— 








4 


+ 




i 




5 


- 






Jol-5b 


1 


4* 


+ 


+ 




2 


+ 


+ 




Jol-5c 


1 


- 


- 


+ 




2 


- 


- 


+ 


Jol-5d 


1 


+ 


+ 


+ 




2 


+ 




i 

■+* 


Jol-6a 


1 


- 


- 


+ 




2 


- 




+ 




3 






T 




4 






+ 




5 


+ 


+ 


+ 


Jol-6b 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jol-6c 


1 




+ 


+ 


Jol-6d 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jol-6d 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jol-6e 


1 


+ 


+ 


+ 




2 


+ 


+ 


+ 


Jol-7a 


1 






+ 




2 


+ 


+ 




Jol-7b 


1 






+ 


Jol-8a 


1 


+ 


+ 


+ 


Jol-8b 


1 


+ 


+ 


+ 
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-r 


+ 


+ 


Jol-va 


1 


+ 


+ 


+ 




-) 
Z 


+ 


+ 


+ 


In 1 -Qh 


i 
1 


+ 


+ 


+ 




2 








Jo I -9c 


I 
t 




I 

T 


+ 


Jol-10 


1 






+ 




2 


+ 


+ 


+ 



Seven transgenic clones gave rise to embryos in which the expression a 
and a* was suppressed. In addition, one clone (Jo I -4) gave rise to embryos in 
which all three P-conglycinin subunits were suppressed. This result is surprising 
as the truncated a transgene sequence overlaps with only a 0.75 kb portion of the 
total 1 .32 kb p subunit cDNA. Overall, there is only 52% similarity between the 
truncated a transgene and the p subunit cDNA. With the knowledge at hand, the 
truncated a transgene would not be considered to possess sufficient similarity of 
stucture to "cosuppress" the p subunit of the P-conglycinin gene. 

An example of an SDS-PAGE analysis is shown in Figure 5. Lanes 1-3 
are extracts of three cotyledons dissected from embryos generated from clone 
Jo 1 -1 . Lanes 4 and 5 are protein molecular weight standards and soy protein 
standard derived from seed, respectively. Lanes 6-8 are extracts of cotyledons 
dissected from embryos generated from clone Jo 1-4. The protein pattern in lane 2 
is an example of embryos in which both a and a' are co-suppressed. The protein 
patterns in lanes 6 and 8 are examples of embryos where all the subunits 
comprising P-conglycinin are suppressed. 

EXAMPLE 2 

To determine if expression of P-conglycinin could be suppressed in 
developing cotyledons by cosuppression using the P-conglycinin promoter region, 
a plasmid, designated pBS43, containing a Glycine max microsomal delta- 12 
desaturase cDNA (GmFad 2-1) sequence (Heppard et al., (1996) Plant Physiol. 
1 10:31 1-319; GenBank Acc. No. L43920) under control of the soybean 
p-conglycinin promoter (Beachy et al., (1985) EMBOJ. 4:3047-3053), was 
constructed. The construction of this vector was facilitated by the use of the 
following plasmids: pMH40, pCST2 and pBS 1 3. The plasmid constructions 
detailed below are described in part in United States Patent Application No. 
USSN 08/262,401 and World Patent Publication No. W094/1 1516, both of which 
are incorporated herein by reference. 

The pMH40 vector was derived from plasmid pGEM9z, a commercially 
available cloning vector (Promega Biotech) by the insertion a 1.4 kb 35S promoter 
region from CaMV (Odell et al. (1985) Nature 30J:81O-812; Harpster et al. (1988) 
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Moi Gen. Genet. 212: 1 82-190) coupled to the p-glucuronidase gene from E. coli. 
This was a 1 .85 kb fragment encoding the enzyme P-glucuronidase (Jefferson et 
al. (1986) PNAS USA £3:8447-8451) and a 0.3 kb DNA fragment containing the 
transcription terminator from the nopaline synthase gene of the Ti-plasmid of 
5 Agrobacterium tumefaciens (Fraley et al. (1983) PNAS USA 50:4803-4807). 

The vector pCST2 was derived from vectors pML18 and pCW109A. The 
plasmid pCW109A contains the soybean p-conglycinin promoter sequence and 
the phaseolin 3* untranslated region and is a modified version of vector pCW109 
which was derived from the commercially available plasmid pUC18 
1 0 (Gibco-BRL). The vector pCWl 09 was made by inserting into the Hind III site of 
the cloning vector pUC 1 8 a 555 bp 5' non-coding region (containing the promoter 
region) of the p-conglycinin gene followed by the multiple cloning sequence 
containing the restriction endonuclease sites for Nco I, Sma I, Kpn I and Xba I, 
then 1 174 bp of the common bean phaseolin 3' untranslated region into the 

1 5 Hind III site. The P-conglycinin promoter region used is an allele of the published 
P-conglycinin gene (Doyle et al., (1986) J. Biol Chem. 257:9228-9238) due to 
differences at 27 nucleotide positions. Further sequence description of this gene 
may be found World Patent Publication W091/13993. 

To facilitate use in antisense constructions, the Nco I site and potential 

20 translation start site in the plasmid pCW109 was destroyed by digestion with 

Nco I, mung bean exonuclease digestion and religation of the blunt site to give the 
modified plasmid pCW109A. 

The vector pML18 consists of the non-tissue specific and constitutive 
cauliflower mosaic virus (35S) promoter (Odell et al., (1985) Nature 375:810-812; 

25 Hull et al., (1 987) Virology 5d:482-493), driving expression of the neomycin 

phosphotransferase gene (Beck et al. (1982) Gene 7P:327-336) followed by the 3' 
end of the nopaline synthase gene including nucleotides 848 to 1550 (Depicker et 
al. (1982) J. Appl Genet. 7:561-574). This transcriptional unit was inserted into 
the commercial cloning vector pGEM9z (Gibco-BRL) and is flanked at the 5' end 

30 of the 35S promoter by the restriction sites Sal I, Xba I, Bam HI and Sma 1, in that 
order. An additional Sal I site is present at the 3' end of the NOS 3' sequence and 
the Xba I, Bam HI and Sal I sites are unique. The plasmid pML18 was digested 
with Xba I, the singled stranded ends were filled-in using the Klenow fragment of 
DNA polymerase I, and the product was ligated in order to remove the Xba I site. 

35 The resulting plasmid was designated pBS 1 6. 

The plasmid pCW109A was digested with Hind III and the resulting 
1.84 kb fragment, which contained the P-conglycinin/antisense delta-12 
desaturase cDNA/phaseolin 3' untranslated region, was gel isolated. This 1.84 kb 
fragment was ligated into the Hind III site of pBS16. A plasmid containing the 
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insert in the desired orientation yielded a 3.53 kb and 4.41 kb fragment when 
digested with Kpn I and this plasmid was designated pCST2. 

The vector pBS 13 was used as the source of the GmFad2-l cDNA, which 
encodes the soybean microsomal delta 1 2-desaturase and possesses the sequence as 
5 disclosed in GenBank Acc. No. L43920. The vector pBS 1 3 was derived from the 
vector pML70 (Fig. 1), which contains the FCTi3 promoter and the KTi3 3' 
untranslated region and was derived from the commercially available vector 
pTZ18R (Pharmacia) via the intermediate plasmids pML51, pML55, pML64 and 
pML65. A 2.4 kb Bst BI/Eco Ri fragment of the complete soybean KTi3 gene 
10 (Jofuku and Goldberg (1989) Plant Cell 7:1079-1093), which contains all 
2039 nucleotides of the 5' untranslated region and 390 bases of the coding 
sequence of the KTi3 gene ending at the Eco RI site corresponding to bases 755 to 
761 of the sequence described in Jofuku (supra), was ligated into the Acc I/Eco RI 
sites of pTZl 8R to create the plasmid pML5 1 . To destroy an Nco I site in the 

1 5 middle of the 5' untranslated region of the KTi3 insert, plasmid pML5 1 was cut 
with Nco I, the singled stranded ends were filled-in using the Klenow fragment of 
DNA polymerase I, and the product was religated resulting in the plasmid pML55. 
The plasmid pML55 was partially digested with Xmn I/Eco RI to release a 
0.42 kb fragment, corresponding to bases 732 to 755 of the above cited sequence, 

20 which was discarded. A synthetic Xmn I/Eco RI linker containing an Nco I site, 
was constructed by making a dimer of complementary synthetic oligonucleotides 
consisting of the coding sequence for an Xmn I site (5 , -TCTTCC-3') and an Nco I 
site (5-CCATGGG-3') followed directly by part of an Eco RI site (5'-GAAGG-3'). 
The Xmn I and Nco I/Eco RI sites were linked by a short intervening sequence 

25 (5-ATAGCCCCCCAA-3'). This synthetic linker was ligated into the 

Xmn I/Eco RI sites of the 4.94 kb fragment to create the plasmid pML64. The 3' 
untranslated region of the KTi3 gene was amplified from the sequence described 
in Jofuku (supra) by standard PCR protocols (Perkin Elmer Cetus, GeneAmp 
PCR kit) using the primers ML51 and ML52. Primer ML51 contained the 

30 20 nucleotides corresponding to bases 1072 to 1091 of the above cited sequence 
with the addition of nucleotides corresponding to Eco RV (5- , GATATC-3 l ), Nco I 
(^-CCATGGO'), Xba I (5'-TCTAGA-3'X Sma I (5*-CCCGGG-3') and Kpn I 
(S'-GGTACCO') sites at the 5* end of the primer. Primer ML52 contained to the 
exact compliment of the nucleotides corresponding to bases 1242 to 1259 of the 

35 above cited sequence with the addition of nucleotides corresponding to Sma I 
(S'-CCCGGGO*), Eco RI (5'-GAATTC-3')> Bam HI (S'-GGATCCO 1 ) and Sal I 
(5'-GTCGAC-3') sites at the 5' end of the primer. The PCR-amplified 3' end of 
the KTi3 gene was ligated into the Nco I/Eco RI sites of pML64 to create the 
plasmid pML65. A synthetic multiple cloning site linker was constructed by 
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making a dimer of complementary synthetic oligonucleotides consisting of the 
coding sequence for Pst I (5'-CTGCA-3'), Sal I (5'-GTCGAC-3*), Bam HI 
(5'-GGATCC-3') and Pst I (S'-CTGCA-T) sites. The linker was ligated into the 
Pst I site (directly 5' to the KTi3 promoter region) of pML65 to create the plasmid 
5 pML70. 

The 1.46 kb Sma I/Kpn I fragment from soybean delta- 12 desaturase 
cDNA, GmFad2-I (GenBank Acc. No. L43920), was ligated into the 
corresponding sites in pML70 resulting in the plasmid pBSlO. The desaturase 
cDNA fragment was in the reverse (antisense) orientation with respect to the KTi3 

10 promoter in pBSlO. The plasmid pBSlO was digested with Bam HI and a 3.47 kb 
fragment, representing the KTi3 promoter/anti sense desaturase cDNA/KTi3 3' end 
transcriptional unit was isolated by agarose gel electrophoresis. The vector 
pML18 consists of the non-tissue specific and constitutive cauliflower mosaic 
vims (35S) promoter (Odell et al., (1985) Nature 5/3:810-812; Hull et al., (1987) 

1 5 Virology #5:482-493), driving expression of the neomycin phosphotransferase 
gene (Beck et al. (1982) Gene 7P:327-336) followed by the 3' end of the nopaline 
synthase gene including nucleotides 848 to 1550 (Depicker et al. (1982) J. Appl 
Genet. 7:561-574). This transcriptional unit was inserted into the commercial 
cloning vector pGEM9z (Gibco-BRL) and is flanked at the 5' end of the 35S 

20 promoter by the restriction sites Sal I, Xba I, Bam HI and Sma I in that order. An 
additional Sal I site is present at the 3* end of the NOS 3' sequence and the Xba I, 
Bam HI and Sal I sites are unique. The 3.47 kb transcriptional unit released from 
pBSlO was ligated into the Bam HI site of the vector pML18 . When the resulting 
plasmids were digested with Sma I and Kpn I, plasmids containing inserts in the 

25 desired orientation yielded 3 fragments of 5.74, 2.69 and 1 .46 kb. A plasmid with 
the transcriptional unit in the correct orientation was selected and was designated 
pBS13. 

The 1 .46 kb Xbal/EcoRV fragment from pBS 1 3 (described above) was 
directionally cloned into the Smal/Xbal site of vector pCST2 (described above) to 
30 yield a plasmid designated pBS39. The 3.3 kb Hindlll fragment of plasmid 

pBS39 was cloned into the Hindlll site of plasmid pMH40 (described above) to 
give the plant expression vector pBS43 (Fig. 6). 

Transformation of soybeans with vector oBS43 and 
identification of a transgenic "Transwitch" line 
35 The vector pBS43 was transformed into soybean meristems using the 

method of particle bombardment of soybean meristems (Christou et al (1990) 
Trends Biotechnol 5:145-151). Seeds of transformed plants (i.e., from plants 
which had been identified as positive for GUS activity) were screened for fatty 
acid composition. Fatty acid methyl esters were prepared from hexane extracts of 
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small (approx. 10 mg) seed chips (Browse et al (1986) Anal Biochem. 
152: 141-145). Seed chips from ten different transgenic lines were analysed and 
some of the Rl seeds from one of these lines, designated 260-05, had a total oleic 
acid content of 80-85% compared with about 20% in control seeds. This 
5 phenotype is caused by the cosuppression of the endogenous Fad 2- 1 gene and is 
the result of the insertion of two copies of pBS43 into a locus of the soybean 
genome designated the "TranSwitch locus" (Kinney, A. J. (1995) in 'Induced 
Mutations and Molecular Techniques for Crop Improvement", International 
Atomic Energy Agency, Vienna). High oleic acid Rl seeds from line 260-05, 

1 0 which contained the TranSwitch locus, were selfed and R2 seeds which were 
homozygous for the Transwitch locus were selected. Two of these R2 
homozygous seeds (G94-1, G94-19) and seeds derived from further generations of 
G94-I and G94-19 (R3, R4, R5), were selected for further analysis. 

R5 seeds of G94-1 and G94-19 plants grown in both Iowa and Puerto Rico 

1 5 were ground into a powder and approximately 1 g extracted with 5 mL of hexane. 
After centifugation, the hexane was poured off and the flakes allowed to air dry. 
Approximately 10 mg of defatted powder was extracted as described above and 
analyzed by SDS-PAGE. In both transgenic lines derived from both locations, the 
expression of the a 1 and a subunits of p-conglycinin were suppressed relative to 

20 control soybean lines and a standard soy flour (Fig. 7). 

EXAMPLE 3 

To test if p-conglycinin expression could be suppressed using antisense 
technology, full length cDNAs of a and a' were made using reverse transcriptase 
polymerase chain reaction as described above. The upper primer, ConSa, is 

25 homologous to region 4- 1 9 of both a and a* cDNA sequences with additional 
nucleotides added to the 5' end to code for a Kpn I restriction site. The lower 
primer used, Con 1.9a, is homologous to regions 1818-1801 of SEQ ID NO:l, 
representing the a isoform, and 1920-1903 of SEQ ID NO:2, representing the a* 
isoform, respectively. To aid in subsequent cloning steps, additional nucleotides 

30 were added to the 5' end to code for an Nco I restriction site. 

ConSa 5 , -ACGGTACCGATGAGAGCGCGGTTCC-3' (SEQ ID NO:8) 
Kpn I 

Con 1 .9a 5'- A ACCCATCGTC AGTA AAAAGCCCTCAA-3' (SEQ ID NO:9) 
35 Ncoi 

Reverse transcription and subsequent PCR reaction were carried out as 
described above. RNA isolated from developing soybean seeds was reverse- 
transcribed using either random hexamers or Con 1.9a ( method as detailed above). 
40 The cDNA was amplified in a PCR reaction using ConSa and Con 1 .9a using the 
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protocol detailed above. Fifteen microliters of the PCR reaction mixes were 
analyzed by agarose gel electrophoresis. A 1.8 kb band, the expected molecular 
weight for a, was observed. The remaining reaction mixes were purified using 
Wizard™ PCR Preps DNA Purification System kit (Promega). The a cDNA, 
5 which because of the primers used included a Kpn I site on the 5' end and an Nco I 
site on the 3' end, was digested with Nco I and Kpn I restriction enzymes. The 
resulting a cDNA was gel isolated, labeled as F10, and directionally cloned 
(antisense orientation) into pC W109 using the Nco I and Kpn I sites present in the 
piasmid. F10 was confirmed as a by PCR using nested primers (upper; Con.09 
10 (SEQ ID NO:6); lower: Conl.4a (SEQ ID NO:5) and Conl.O (SEQ ID NO: 10)). 

Conl.O S'-CGGGTATGGCG AGTGTT-3 ' (SEQ ID NO:10) 

The transcriptional unit P-conglycinin promoter/a cDNA 

15 antisense/phaseolin 3' end was released from pCW109/FI0 by partial digest with 
Hind III. Conditions of the partial digest were such that 6 fragments were 
produced (5.1 kb, 3.8 kb, 3,6 kb, 2.6 kb, 2.4 kb, and 1.2 kb). The 3.6 kb fragment 
containing the the transcriptional unit was gel isolated and labeled F14. F14 was 
then cloned into the Hind III site of pKS18HH. After confirming insertion by 

20 digestion of piasmid DNA preparations made from tansformed cells with Hind III, 
the piasmid DNA from positive cultures was digested with Kpn I to ensure that 
they contained the 3.6 kb F14 fragment and not the 3.8 kb fragment from the 
partial digest of pCW109/F10 with Hind III. F14 contains a Kpn I site, while the 
3.8 kb fragment does not. Upon confirmation, pKS 1 8HH/F14 was labeled pJo3 

25 (Fig. 8). Soybean embryonic suspension cultures were transformed with pJo3 as 
detailed above. Transformation resulted in 5 transformed clones; upon maturation 
each clone gave rise to 4 to 8 somatic embryos. 

Protein extracts of transformed somatic embryos were analyzed by 
SDS-PAGE as previously detailed. Results are presented in Table 2. The 

30 transgenic clones all gave rise to at least one somatic embryo in which the 
expression of both a and a' was suppressed. 

TABLE 2 

Clone Embryo a a] P 

+ 

+ + + 

+ 
+ 

+ 

+ 



Jo3-l 
Jo3-2 
Jo3-2b 



1 
2 

1 
2 

1 

2 
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Jo3-3 1 - . + 

2 

Jo3-4 1 . . . 

2 

EXAMPLF.4 

There are five non allelic genes that code for the glycinin subunits. 
Sequencing genomic clones and cDNA's have lead to a division of the subunit 
5 genes into two groups based on sequence similarity. Group I consists of Gy 1 
(SEQ ID NO:! 1), Gy2 (SEQ ID NO:12) and Gy3 (SEQ ID NO:13), whereas 
group II consists of Gy4 (SEQ ID NO: 1 4) and Gy5 (SEQ ID NO: 15). There is 
greater than 85% similarity between genes within a group, but only 42% to 46% 
similarity between genes of different groups. To determine whether expression of 
0 glycinin can be suppressed in developing cotyledons by employing co-suppression 
technology, cDNA's of Group I and Group II were prepared using reverse 
transcriptase polymerase chain reaction as described above. 

The upper primer used for Group I reactions (GM ) is homologus to 
regions 1 - 1 9 for all Group I cDNA's. Two lower primers were used: G 1 - 1 039, 
5 which is homologous with regions 1 038- 1 022 of Gy 1 , 1 008-992 of Gy2, and 
996-980 of Gy3; or Gl-1475, which is homologus to regions 1475-1460 of Gyl, 
1445-1430 of Gy2 and 1433-1418 of Gy3. To aid in future cloning, all primers ' 
contained additional nucleotides that coded for a Not I restriction site at their 5' 
end. 

Gl-1 5'-GCQGCCGCATGGCCAAGCTAGTTTTTT-3' (SEQ ID N(M6) 

Not! " ' 

01-1039 5'-QCGGCCGCTGGTGGCGTTTGTGA-3' (SEQ ID NO- 17) 
Net I ' ' 

Gl-1475 5'-GCGGCCGCTCTTCTGAGACTCCT-3' (SEQ ID NO' 18) 

Not I * ' 

RNA isolated from developing soybean seeds was reverse-transcribed 
using either random hexamers, or Gl-1475 or GI -1039 as the lower primer in the 
reactions. cDNA fragments were amplified using a mixture of G I - 1 with either 
Gl-1 039 or Gl-1475. Fifteen microliters of the PCR reaction mixes were 
analyzed by agarose gel electrophoresis. PCR reactions resulted in products of the 
expected molecular wieght, approximately 1 kb and 1 .4-1.5 kb for primer sets 
Gl-l/Gl-1039 and Gl-I/Gl-1475, respectively. cDNA fragments from the 
remainder of the reaction mixes were purified using the Wizard™ PCR Preps DNA 
Purification System kit (Promega). Purified cDNA's were then digested with 
Not I and isolated by agarose gel purification. 
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The upper primer used for RT-PCR reactions of Group II (G4-7) is 
homologus to regions 7-22 for both cDNA's of Group II. Two lower primers were 
used: G4-1251 which is homologus with regions 1251-1234 of Gy4 and 
1 153-1 135 of Gy5; or G4-1670 which is homologus to regions 1668-1653 of Gy4. 
5 There is no similar region in Gy5. To aid in future cloning all primers contained 
additional nucleotides that coded for a Not I restriction site at their 5' end. 



G4-7 5'- GCGGCCGCA TGCCCTTrArTrTrT.r (SEQ ID NO: 1 9) 
Not! 

10 G4- 1 25 1 5'-GCGGCCGCTGGGAGGGTGAGGCTGTT-3' (SEQ ID NO:20) 

Notl 

G4-1670 S'-GCGGCCGCTGAGCCTTGTTGAGACO' (SEQ ID NO:21) 
Notl 

1 5 RNA isolated from developing soybean seeds was reverse-transcribed 

using either random hexamers, or G4-1251 or G4-1670 as the lower primer in the 
reactions. cDNA fragments were amplified using a mixture of G4-7 with either 
G4- 1 25 1 or G4- 1 670. Fifteen microliters of the PGR reaction mixes were 
analyzed by agarose gel electrophoresis. PGR reactions resulted in products of the 

20 expected molecular weight, approximately i .25 kb and 1 .7 kb for primer sets 
G4-7/G4- 1 25 1 and G4-7/G4- 1 6.70, respectively. cDNA fragments from the 
remainder of the reaction mixes were purified using the Wizard™ PCR Preps DNA 
Purification System kit (Promega). Purified cDNA's were then digested with 
Not I and isolated from gels. 

25 The isolated group I cDNAs are cloned into pRB20 (Fig. 9) at the Not I 

site (sense oritentation). After partial restriction digest with Not I and isolation of 
the single cut pRB20/group I linear fragments, group II cDNA are added to create 
final transcriptional units [3-conglycinin promoter/group I cDNA (sense 
orientation)/phaseolin 3* end and P-conglycinin promoter/group II cDNA (sense 

30 orientation)/phaseolin 3' end. The resulting plasmids are then used to transform 
somatic embryonic suspension cultures using the method detailed above. 
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(1} GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: E. I. DU PONT DE NEMOURS AND COMPANY 

(B) STREET: 1007 MARKET STREET 

(C) CITY: WILMINGTON 

(D) STATE: DELAWARE 

(E) COUNTRY: UNITED STATES OF AMERICA 

(F) ZIP: 19898 

(G) TELEPHONE: 302-992-5481 

(H) TELEFAX: 302-773-0164 
{I) TELEX: 6717325 

(ii) TITLE OF INVENTION: SUPPRESSION OF SPECIFIC 

CLASSES OF SOYBEAN SEED 
PROTEIN GENES 

(iii) NUMBER OF SEQUENCES: 21 

(iv> COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE, 3.50 INCH 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: MICROSOFT WORD FOR WINDOWS 95 

(D) SOFTWARE: MICROFSOFT WORD 7.0 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/019,940 

(B) FILING DATE: JUNE 14,1996 

(vii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: LYNNE M. CHRISTENBURY 

(B) REGISTRATION NUMBER: 30,971 

tC) REFERENCE/DOCKET NUMBER: BB-1071 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATGATGAGAG CACGGTTCCC ATTACTGTTG CTGGGACTTG TTTTCCTGGC TTCAGTTTCT 60 
GTCTCATTTG GCATTGCTTA CTGGGAAAAA GAGAACCCCA AACACAACAA GTGTCTCCAG 120 
AGTTGCAATA GCGAGAGAGA CTCGTACAGG AACCAAGCAT GCCACGCTCG TTGCAACCTC 180 
CTTAAGGTGG AGAAAGAAGA ATGTGAAGAA GGTGAAATTC CACGACCACG ACCACGACCA 240 
CAACACCCGG AGAGGGAACC TCAGCAACCC GGTGAGAAGG AGGAAGACGA AGATGAGCAA 300 
CCACGTCCAA TCCCATTCCC ACGCCCACAA CCTCGTCAAG AAGAAGAGCA CGAGCAGAGA 3 60 
GAGGAACAGG AATGGCCTCG CAAGGAGGAA AAACGCGGAG AAAAGGGAAG TGAAGAGGAA 420 
GAT GAG GAT G AGGATGAGGA ACAAGATGAA CGTCAATTCC CATTCCCACG CCCACCTCAT 4 80 
CAGAAGGAAG AGCGAAACGA AGAGGAAGAT GAGGATGAGG AGCAGCAGCG AGAGAGCGAA 54 0 
GAAAGTGAAG ATTCTGAGTT ACGAAGACAT AAGAATAAGA ACCCTTTTCT CTTCGGCTCT 600 
AACAGGTTCG AAACTCTCTT CAAAAACCAA TATGGTCGCA TTCGCGTCCT CCAGAGGTTC 660 
AACCAACGCT CCCCACAACT TCAGAATCTC CGAGACTACC GCATTTTGGA GTTCAACTCC 720 
AAACCCAACA CCCTCCTTCT CCCCAACCAT GCTGACGCTG ATTACCTCAT CGTTATCCTT 780 
AACGGGACTG CCATTCTTTC CTTGGTGAAC AACGACGACA GAGACTCCTA CAGACTTCAA 84 0 
TCTGGTGATG CCCTGAGAGT CCCCTCAGGA ACCACATACT ATGTGGTCAA CCCTGACAAC 900 
AACGAAAATC TCAGATTAAT AACACTCGCC ATACCCGTTA ACAAGCCTGG TAGATTTGAG 960 
AGTTTCTTCC TATCTAGCAC TGAAGCTCAA CAATCCTACT TGCAAGGATT 'CAGCAGGAAC 1020 
ATTTTAGAGG CCTCCTACGA TACCAAATTC GAGGAGATAA ACAAGGTTCT GTTTAGTAGA 1080 
GAGGAAGGGC AGCAGCAAGG GGAGCAGAGG CTGCAAGAGA GCGTGATTGT GGAAATCTCG 114 0 
AAGGAACAGA TTCGGGCACT GAGCAAACGT GCCAAATCTA GTTCAAGGAA AACCATTTCT 1200 
TCTGAAGATA AACCTTTTAA CTTGAGAAGC CGCGACCCCA TCTACTCCAA CAAGCTTGGC 1260 
AAGTTCTTTG AGATCACCCC AGAGAAAAAC CCCCAGCTTC GGGACTTGGA TATCTTCCTC 1320 
AGTATTGTGG ATATGAACGA GGGAGCTCTT CTTCTACCAC ACTTCAATTC AAAGGCGATA 1380 
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GTGATACTGG TAATTAATGA AGGAGATGCA AACATTGAAC TTGTTGGCCT AAAAGAACAA 1440 

CAACAGGAGC AGCAACAGGA AGAGCAACCT TTGGAAGTGC GGAAATATAG AGCCGAATTG 1500 

TCTGAACAAG ATATATTTGT AATCCCAGCA GGTTATCCAG TTGTGGTCAA CGCTACCTCA 1560 

AATCTGAATT TCTTTGCTAT TGGTATTAAT GCCGAGAACA ACCAGAGGAA CTTCCTCGCA 1620 

GGTTCGCAAG. ACAATGTGAT AAGCCAGATA CCTAGTCAAG TGCAGGAGCT TGCATTCCCT 1680 

GGGTCTGCAC AAGCTGTTGA GAAGCTATTA AAGAACCAAA GAGAATCCTA CTTTGTGGAT 17 4 0 

GCTCAGCCTA AGAAGAAAGA GGAGGGGAAT AAGGGAAGAA AGGGTCCTTT GTCTTCAATT 1800 

TTGAGGGCTT TTTACTGA 1818 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATGATGAGAG CGCGGTTCCC ATTACTGTTG CTGGGAGTTG TTTTCCTAGC ATCAGTTTCT 60 

GTCTCATTTG GCATTGCGTA TTGGGAAAAG CAGAACCCCA GTCACAACAA GTGCCTCCGA 120 

AGTTGCAATA GCGAGAAAGA CTCCTACAGG AACCAAGCAT GCCACGCTCG TTGCAACCTC 180 

CTTAAGGTGG AGGAAGAAGA AGAATGCGAA GAAGGTCAAA TTCCACGACC ACGACCACAA 24 0 

CACCCGGAGA GGGAACGTCA GCAACACGGT GAGAAGGAGG AAGACGAAGG TGAGCAGCCA 300 

CGTCCATTCC CATTCCCACG CCCACGCCAA CCTCATCAAG AGGAAGAGCA CGAGCAGAAG 360 

GAGGAACACG AATGGCATCG CAAGGAGGAA AAACACGGAG GAAAGGGAAG TGAAGAGGAA 420 

CAAGATGAAC GTGAACACCC ACGCCCACAC CAACCTCATC AAAAGGAAGA GGAAAAGCAC 4 80 

GAATGGCAAC ACAAGCAGGA AAAGCACCAA GGAAAGGAAA GTGAAGAAGA AGAAGAAGAC 540 

CAAGACGAGG ATGAGGAGCA AGACAAAGAG AGCCAAGAAA GTGAAGGTTC TGAGTCTCAA 600 

AGAGAACCAC GAAGACATAA GAATAAGAAC CCTTTTCACT TCAACTCTAA AAGGTTCCAA 660 

ACTCTCTTCA AAAACCAATA TGGCCACGTT CGCGTCCTCC AGAGGTTCAA CAAACGCTCC 720 

CAACAGCTTC AGAATCTCCG AGACTACCGC ATTTTGGAGT TCAACTCCAA ACCCAACACC 780 

CTTCTTCTCC CCCACCATGC TGACGCTGAT TACCTCATCG TTATCCTTAA CGGGACTGCC 840 

ATTCTTACCT TGGTGAACAA CGACGACCGA GACTCTTACA ACCTTCAATC TGGCGATGCC 900 
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CTAAGAGTCC CTGCAGGAAC CACATTCTAT G7GGTTAACC CTGACAACGA CGAGAATCTC 960 

AGAATGATAG CAGGAACCAC ATTCTATGTG GTTAACCCTG ACAACGACGA GAATCTCAGA 1020 

ATGATAACAC TCGCCATACC CGTTAACAAA CCCGGTAGAT TTGAGAGTTT CTTCCTATCT 1080 

AGCACTCAAG CTCAACAGTC CTACTTGCAA GGGTTCAGCA AGAATATTCT AGAGGCCTCA 1140 

TACGACACCA AATTCGAGGA GATAAACAAG GTTCTGTTTG GTAGAGAGGA GGGGCAGCAA 1200 

CAAGGGGAGG AGAGGCTGCA AGAGAGTGTG ATTGTGGAAA TCTCAAAGAA ACAAATTCGG 1260 

GAACTGAGCA AACATGCCAA ATCTAGTTCA AGGAAAACCA TTTCTTCTGA AGATAAACCT 1320 

TTCAACTTGG GAAGCCGCGA CCCCATCTAT TCCAACAAGC TTGGCAAGTT GTTTGAGATT 1380 

ACCCAGAGAA ACCCTCAGCT TCGGGACTTG GATGTCTTCC TCAGTGTTGT GGATATGAAC 14 4 0 

GAGGGAGCTC TTTTTCTACC ACACTTCAAT TCAAAGGCCA TAGTGGTACT AGTGATTAAT 1500 

GAAGGAGAAG CAAACATTGA ACTTGTTGGC ATTAAAGAAC AACAACAGAG GCAGCAACAG 1560 

GAAGAGCAAC CTTTGGAAGT GCGGAAATAT AGAGCTGAAT TGTCTGAACA AGATATATTT 1620 

GTAATCCCAG CAGGTTATCC AGTTATGGTC AACGCTACCT CAGATCTGAA TTTCTTTGCT 1680 

TTTGGTATCA ATGCCGAGAA CAACCAGAGG AACTTCCTTG CAGGTTCGAA AGACAATGTG 174 0 

ATAAGCCAGA TACCTAGTCA AGTGCAGGAG CTTGCGTTCC CTAGGTCTGC AAAAGATATT 1800 

GAGAACCTAA TAAAGAGCCA AAGTGAGTCC TACTTTGTGG ATGCTCAGCC TCAGCAGAAA 1860 

GAGGAGGGGA ACAAGGGAAG AAAGGGTCCT TTGTCTTCAA TTTTGAGGGC TTTTTACTGA 1920 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGATGAGAG TGCGGTTTCC TTTGTTGGTG TTGCTGGGAA CTGTTTTCCT GGCATCAGTT 60 

TGTGTCTCAT TAAAGGTGAG AGAGGATGAG AATAACCCTT TCTACTTTAG AAGCTCTAAC 120 

AGCTTCCAAA CTCTCTTTGA GAACCAAAAC GTTCGCATTC GTCTCCTCCA GAGATTCAAC 180 

AAACGCTCCC CACAACTTGA GAACCTTCGA GACTACCGGA TTGTCCAGTT TCAGTCAAAA 240 

CCCAACACAA TCCTTCTCCC CCACCATGCT GACGCCGATT TCCTCCTCTT TGTCCTTAGC 300 

GGGAGAGCCA TACTTACCTT GGTGAACAAC GACGACAGAG ACTCCTACAA CCTTCACCCT 360 
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GGCGATGCCC 


AGAGAATCCC 


AGCTGGAACC 


ACTTACTATT 


TGGTTAACCC 


TCACGACCAC 


420 


CAGAATCTCA 


AAATAATCAA 


ACTTGCCATA 


CCCGTCAACA 


AACCTGGCAG 


ATATGATGAT 


4 80 

*f O 'J 


TTCTTCTTAT 


CTAGCACTCA 


AGCCCAACAG 


TCCTACTTGC 


AAGGCTTCAG 


CCATAATATT 


54 0 
j *i \j 


CTAGAGACCT 


CCTTCCATAG 


CGAATTCGAG 


GAGATAAACA 


GGGTTTTGTT 


TGGAGAGGAA 


600 


GAGGAGCAGA 


GGCAGCAAGA 


GGGAGTGATC 


GTGGAACTCT 


CAAAGGAACA 


AATTCGGCAA 


660 


CTGAGCAGAC 


GTGCCAAATC 


TAGTTCAAGG 


AAAACCATTT 


CCTCCGAAGA 


TGAACCATTC 


720 


AACTTGAGAA 


GCCGCAACCC 


CATCTATTCC 


AACAACTTTG 


GAAAGTTCTT 


TGAGATCACC 


780 


CCTGAGAAAA 


ACCCACAGCT 


TCGGGACTTG 


GATATCTTCC 


TCAGTTCTGT 


GGATATCAAC 


640 


GAAGGAGCTC 


TTCTTCTACC 


ACACTTCAAT 


TCAAAGGCCA 


TAGTGATACT 


AGTGATTAAT 


900 


GAAGGAGATG 


CAAACATTGA ACTTGTTGGC 


ATTAAAGAAC 


AACAACAGAA 


GCAGAAACAG 


960 


GAAGAGGAAC 


CTTTGGAAGT 


GCAAAGGTAC 


AGAGCTGAAT 


TGTCTGAAGA 


CGATGTATTT 


1020 


GTAATTCCAG 


CAGCTTATCC 


ATTTGTCGTC 


AACGCTACCT 


CAAACCTCAA 


TTTCCTTGCT 


1080 


TTTGGTATCA 


ATGCTGAGAA 


CAACCAGAGG 


AACTTCCTTG 


CAGGCGAGAA 


AGACAATGTG 


1140 


GTAAGGCAGA 


TAGAAAGACA 


AGTGCAGGAG 


CTTGCGTTCC 


CTGGGTCTGC 


ACAAGATGTT 


1200 


GAGAGGCTAT 


TAAAGAAGCA 


GAGGGAATCC 


TACTTTGTTG 


ATGCTCAGCC 


TCAGCAGAAG 


1260 


GAGGAGGGGA 


GTAAGGGAAG 


AAAGGGTCCT 


TTTCCTTCAA 


TCTTAGGTGC 


TCTCTACTGA 


1320 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGTACCATGG TGAGAGCGCG GTTCC 25 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



32 



WO 97/47731 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 5 
CGGTACCGAA TTGAAGTGTG GTAG 
£2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

TCGTCCATGG AGCGCGGTTC CCATTAC 

(2) INFORMATION FOR SEQ ID NO; 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 
TCTCGGTCGT CGTTGTT 
(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8 
ACGGTACCGA TGAGAGCGCG GTTCC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

AACCCATGGT CAGTAAAAAG CCCTCAA 2 7 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CGGGTATGGC GAGTGTT 17 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGGCCAAGC TAGTTTTTTC CCTTTGTTTT CTGCTTTTCA GTGGCTGCTG CTTCGCTTTC 60 

AGTTCCAGAG AGCAGCCTCA GCAAAACGAG TGCCAGATCC AAAAACTCAA TGCCCTCAAA 120 

CCGGATAACC GTATAGAGTC AGAAGGAGGG CTCATTGAGA CATGGAACCC TAACAACAAG 180 

CCATTCCAGT GTGCCGGTGT TGCCCTCTCT CGCTGCACCC TCAACCGCAA CGCCCTTCGT 24 0 

AGACCTTCCT ACACCAACGG TCCCCAGGAA ATCTACATCC AACAAGGTAA GGGTATTTTT 300 

GGCATGATAT ACCCGGGTTG TCCTAGCACA TTTGAAGAGC CTCAACAACC TCAACAAAGA 360 

GGACAAAGCA GCAGACCACA AGACCGTCAC CAGAAGATCT ATAACTTCAG AGAGGGTGAT 420 

TTGATCGCAG TGCCTACTGG TGTTGCATGG TGGATGTACA ACAATGAAGA CACTCCTGTT 4 80 

GTTGCCGTTT CTATTATTGA CACCAACAGC TTGGAGAACC AGCTCGACCA GATGCCTAGG 54 0 

AGATTCTATC TTGCTGGGAA CCAAGAGCAA GAGTTTCTAA AATATCAGCA AGAGCAAGGA 600 

GGTCATCAAA GCCAGAAAGG AAAGCATCAG CAAGAAGAAG AAAACGAAGG AGGCAGCATA 660 

TTGAGTGGCT TCACCCTGGA ATTCTTGGAA CATGCATTCA GCGTGGACAA GCAGATAGCG 720 

AAAAACCTAC AAGGAGAGAA CGAAGGGGAA GACAAGGGAG CCATTGTGAC AGTGAAAGGA 780 

GGTCTGAGCG TGATAAAACC ACCCACGGAC GAGCAGCAAC AAAGACCCCA GGAAGAGGAA 84 0 
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GAAGAAGAAG AGGATGAGAA GCCACAGTGC AAGGGTAAAG ACAAACACTG CCAACGCCCC 900 

CGAGGAAGCC AAAGCAAAAG CAGAAGAAAT GGCATTGACG AGACCATATG CACCATGAGA 960 

CTTCGCCACA ACATTGGCCA GACTTCATCA CCTGACATCT ACAACCCTCA AGCCGGTAGC 1020 

GTCACAACCG CCACCAGCCT TGACTTCCCA GCCCTCTCGT GGCTCAGACT CAGTGCTGAG 1080 

TTTGGATCTC TCCGCAAGAA TGCAATGTTC GTGCCACACT ACAACCTGAA CGCGAACAGC 1140 

ATAATATACG CATTGAATGG ACGGGCATTG ATACAAGTGG TGAATTGCAA CGGTGAGAGA 1200 

GTGTTTGATG GAGAGCTGCA AGAGGGACGG GTGCTGATCG TGCCACAAAA CTTTGTGGTG 12 60 

GCTGCAAGAT CACAGAGTGA CAACTTCGAG TATGTGTCAT TCAAGACCAA TGATACACCC 1320 

ATGATCGGCA CTCTTGCAGG GGCAAACTCA TTGTTGAACG CATTACCAGA GGAAGTGATT 1380 

CAGCACACTT TCAACCTAAA AAGCCAGCAG GCCAGGCAGA TAAAGAACAA CAACCCTTTC 14 40 

AAGTTCCTGG TTCCACCTCA GGAGTCTCAG AAGAGAGCTG TGGCTTAG 14 88 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1458 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

ATGGCCAAGC TTGTTCTTTC CCTTTGTTTC CTTCTTTTCA GTGGCTGCTT CGCTCTGAGA 60 

GAGCAGGCAC AGCAAAATGA GTGCCAGATC CAAAAGCTGA ATGCCCTCAA ACCGGATAAC 120 

CGTATAGAGT CGGAAGGTGG GTTCATTGAG ACATGGAACC CTAACAACAA GCCATTCCAG 180 

TGTGCCGGTG TTGCCCTCTC TCGCTGCACC CTTAACCGCA ATGCCCTTCG TAGACCTTCC 24 0 

TACACCAACG GTCCCCAGGA AATCTACATA CAACAAGGTA ATGGTATTTT TGGCATGATA 300 

TTCCCGGGTT GTCCTAGCAC TTATCAAGAG CCGCAAGAAT CTCAGCAACG AGGACGAAGC 360 

CAGAGGCCCC AAGACCGTCA CCAAAAGGTA CATCGCTTCA GAGAGGGTGA TTTGATCGCA 4 20 

GTGCCTACTG GTGTTGCATG GTGGATGTAC AACAATGAAG ACACTCCTGT TGTTGCCGTT 480 

TCTATTATTG ACACCAACAG CTTGGAGAAC CAGCTCGACC AGATGCCTAG GAGATTCTAT 54 0 

CTTGCTGGGA ACCAAGAGCA AGAGTTTCTA AAATATCAGC AGCAGCAGCA AGGAGGTTCC 600 

CAAAGCCAGA AAGGAAAGCA ACAAGAAGAA GAAAACGAAG GAAGCAACAT ATTGAGTGGC 660 

TTCGCCCCTG AATTCTTGAA AGAAGCGTTC GGCGTGAACA TGCAGATAGT GAGAAACCTA 720 
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CAAGGTGAGA ACGAAGAGGA GGATAGTGGA GCCATTGTGA CAGTGAAAGG AGGTCTAAGA 780 

GTCACAGCTC CAGCCATGAG GAAGCCACAG CAAGAAGAAG ATGATGATGA TGAGGAAGAG 84 0 

CAGCCACAGT GCGTGGAGAC AGACAAAGGT TGCCAACGCC AAAGCAAAAG GAGCAGAAAT 900 

GGCATTGATG AGACCATTTG CACAATGAGA CTTCGCCAAA ACATTGGTCA GAATTCATCA 960 

CCTGACATCT ACAACCCTCA AGCTGGTAGC A7CACAACCG CCACCAGCCT TGACTTCCCA 1020 

GCCCTCTGGC TTCTCAAACT CAGTGCCCAG TATGGATCAC TCCGCAAGAA TGCTATGTTC 1080 

GTGCCACACT ACACCCTGAA CGCGAACAGC ATAATATACG CATTGAATGG GCGGGCATTG 1140 

GTACAAGTGG TGAATTGCAA TGGTGAGAGA GTGTTTGATG GAGAGCTGCA AGAGGGAGGG 1200 

GTGCTGATCG TTCCACAAAA CTTTGCGGTG GCTGCAAAAT CCCAGAGCGA TAACTTTGAG 1260 

TATGTGTCAT TCAAGACCAA TGATAGACCC TCGATCGGAA ACCTTGCAGG GGCAAACTCA 1320 

TTGTTGAACG CATTGCCAGA GGAAGTGATT CAGCACACTT TTAACCTAAA GAGCCAGCAG 1380 

GCCAGGCAGG TGAAGAACAA CAACCCTTTC AGCTTCCTTG TTCCACCTCA GGAGTCTCAG 14 4 0 

AGGAGAGCTG TGGCTTAG 1458 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ATGGCTAAGC 


TTGTTCTTTC 


CCTTTGTTTT 


CTGCTTTTCA 


GTGGCTGCTG 


CTTCGCTTTC 


60 


AGTTTCAGAG 


AGCAGCCACA 


GC AAAAC GAG 


TGCCAGATCC 


AACGCCTCAA 


TGCCCTAAAA 


120 


CCGGATAACC 


GTATAGAGTC 


AGAAGGTGGC 


TTCATTGAGA CATGGAACCC 


TAACAACAAG 


180 


CCATTCCAGT 


GTGCCGGTGT 


TGCCCTCTCT 


CGCTGCACCC 


TCAACCGCAA 


CGCCCTTCGC 


240 


AGACCTTCCT 


ACACCAACGC 


TCCCCAGGAG 


ATCTACATCC 


AACAAGGTAG 


TGGTATTTTT 


300 


GGCATGATAT 


TCCCGGGTTG 


TCCTAGCACA 


TTTGAAGAGC 


CTCAACAAAA 


AGGACAAAGC 


360 


AGCAGGCCCC 


AAGACCGTCA 


CCAGAAGATC 


TATCACTTCA 


GAGAGGGTGA 


TTTGATTGCA 


420 


GTGCCAACCG 


GTTTTGCATA 


CTGGATGTAC 


AACAATGAAG 


ACACTCCTGT 


TGTTGCCGTT 


480 


TCTCTTATTG 


ACACCAACAG 


CTTCCAGAAC 


CAGCTCGACC 


AGATGCCTAG 


GAGATTCTAT 


540 


CTTGCTGGGA 


ACCAAGAGCA 


AGAGTTTCTA 


CAGTATCAGC 


CACAGAAGCA 


GCAAGGAGGT 


600 
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ACTCAAAGCC AGAAAGGAAA GCGTCAGCAA GAAGAAGAAA ACGAAGGAGG CAGCATATTG 660 

AGTGGCTTCG CCCCGGAATT CTTGGAACAT GCGTTCGTCG TGGACAGGCA GATAGTGAGA 720 

AAGCTACAAG GTGAGAACGA AGAGGAAGAG AAGGGTGCCA TTGTGACAGT GAAAGGAGGT 780 

CTCAGCGTGA TAAGCCCACC CACGGAAGAG CAGCAACAAA GACCCGAGGA AGAGGAGAAG 84 0 

CCAGATTGTG ACGAGAAAGA CAAACATTGC CAAAGCCAAA GCAGAAATGG CATTGACGAG 900 

ACCATTTGCA CAATGAGACT TCGCCACAAC ATTGGCCAGA CTTCATCACC TGACATCTTC 960 

AACCCTCAAG CTGGTAGCAT CACAACCGCT ACCAGCCTCG ACTTCCCAGC CCTCTCGTGG 1020 

CTCAAACTCA GTGCCCAGTT TGGATCACTC CGCAAGAATG CTATGTTCGT GCCACACTAC 1080 

AACCTGAACG CAAACAGCAT AATATACGCA TTGAATGGAC GGGCATTGGT ACAAGTGGTG 1140 

AATTGCAATG GTGAGAGAGT GTTTGATGGA GAGCTGCAAG AGGGACAGGT GTTAATTGTG 1200 

CCACAAAACT TTGCGGTGGC TGCAAGATCA CAGAGCGACA ACTTCGAGTA TGTTTCATTC 1260 

AAGACCAATG ATAGACCCTC GATCGGCAAC CTTGCAGGTG CAAACTCATT GTTGAACGCA 1320 

TTGCCGGAGG AAGTGATTCA GCAAACTTTT AACCTAAGGA GGCAGCAGGC CAGGCAGGTC 1380 

AAGAACAACA ACCCTTTCAG CTTCCTGGTT CCACCTAAGG AGTCTCAGAG GAGAGTTGTG 14 40 

GCTTAG 14 4 6 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATGGGGAAGC CCTTCACTCT CTCTCTTTCT TCCCTTTGCT TGCTACTCTT GTCGAGTGCA 60 

TGCTTTGCTA TTAGCTCCAG CAAGCTCAAC GAGTGCCAAC TCAACAACCT CAACGCGTTG 120 

GAACCCGACC ACCGCGTTGA GTTCGAAGGT GGTTTGATTC AAACATGGAA CTCTCAACAC 180 

CCTGAGCTGA AATGCGCCGG TGTCACTGTT TCCAAACTCA CCCTCAACCG CAATGGCCTC 240 

CACTTGCCAT CTTACTCACC TTATCCCCGG ATGATCATCA TCGCCCAAGG GAAAGGAGCA 300 

CTGCAGTGCA AGCCAGGATG TCCTGAGACG TTTGAGGAGC CACAAGAACA ATCAAACAGA 360 

AGAGGCTCAA GGTCGCAGAA GCAGCAGCTA CAGGACAGTC ACCAGAAGAT TCGTCACTTC 420 

AATGAAGGAG ACGTACTCGT GATTCCTCCT GGTGTTCCTT ACTGGACCTA TAACACTGGC 4 80 
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GATGAACCAG TTGTTGCCAT CAGTCTTCTT GACACCTCTA ACTTCAATAA CCAGCTTGAT 54 0 

CAAACCCCTA GGGTATTTTA CCTTGCTGGG AACCCAGA7A TAGAGTACCC AGAGACCATG 600 
CAACAACAAC AACAGCAGAA AAGTCATGGT GGACGCAAGC AGGGGCAACA CCAGCAGGAG 660 
GAAGAGGAAG AAGGTGGCAG CGTGCTCAGT GGCTTCAGCA AACACTTCTT GGCACAATCC 720 
TTCAACACCA ACGAGGACAT AGCTGAGAAA CTTCAGTCTC CAGACGACGA AAGGAAGCAG 780 
ATCGTGACAG TGGAAGGAGG TCTCAGCGTT ATCAGCCCCA AGTGGCAAGA ACAACAAGAT 84 0 
GAAGATGAAG ATGAAGACGA AGATGATGAA GATGAACAAA TTCCCTCTCA CCCTCCTCGC 900 
CGACCAAGCC ATGGAAAGCG TGAACAAGAC GAGGACGAGG ACGAAGATGA AGATAAACCT 960 
CGTCCTAGTC GACCAAGCCA AGGAAAGCGT GAACAAGACC AGGACCAGGA CGAGGACGAA 1020 
GATGAAGATG AAGATCAACC TCGCAAGAGC CGCGAATGGA GATCGAAAAA GACACAACCC 1080 
AGAAGACCTA GACAAGAAGA ACCACGTGAA AGAGGATGCG AGACAAGAAA CGGGGTTGAG 114 0 
GAAAATATCT GCACCTTGAA GCTTCACGAG AACATTGCTC GCCCTTCACG CGCTGACTTC 1200 
TACAACCCTA AAGCTGGTCG CATTAGTACC CTCAACAGCC TCACCCTCCC AGCCCTCCGC 1260 
CAATTCCAAC TCAGTGCCCA ATATGTTGTC CTCTACAAGA ATGGAATTTA CTCTCCACAT 1320 
TGGAATCTGA ATGCAAACAG TGTGATCTAT GTGACTCGAG GACAAGGAAA GGTTAGAGTT 1380 
GTGAACTGCC AAGGGAATGC AGTGTTCGAC GGTGAGCTTA GGAGGGGACA ATTGCTGGTG 14 40 
GTACCACAGA ACTTCGTGGT GGCGGAGCAA GCCGGAGAAC AAGGATTCGA ATACATAGTA 1500 
TTCAAGACAC ACCACAACGC AGTCACTAGC TACTTGAAGG ATGTGTTTAG GGCAATTCCC 1560 
TCAGAGGTTC TTGCCCATTC TTACAACCTT CGACAGAGTC AAGTGTCTGA GCTTAAGTAT 1620 
GAAGGAAATT GGGGTCCTTT GGTCAACCCT GAGTCTCAAC AAGGCTCACC CCGTGTTAAA 1680 
GTCGCATAA 1689 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1551 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATGGGGAAGC CCTTCTTCAC TCTCTCTCTT TCTTCCCTTT GCTTGCTACT CTTGTCGAGT 60 
GCATGCTTTG CTATTACCTC CAGCAAGTTC AACGAGTGCC AACTCAACAA CCTCAACGCG 120 
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TTGGAACCCG ACCACCGCGT 


TGAGTCCGAA 


GGTGGTCTTA 


. TTGAAACATG 


PCT/US97/09743 

GAACTCTCAA lfln 


CACCCTGAGC 


TGCAATGCGC 


CGGTGTCACT 


GTTTCCAAAC 


GCACCCTCAA 


CCGCAACGGC 


240 


TCCCACTTGC 


CATCTTACTT 


ACCTTATCCC 


CAAATGATCA 


TTGTCGTTCA 


AGGGAAGGGA 


300 


GCAATTGGAT 


TTGCATTTCC 


GGGATGTCCC 


GAGACGTTTG 


AGAAGCCACA 


ACAACAATCA 


360 


AGCAGAAGAG 


GCTCAAGGTC 


ACAGCAGCAA 


CTACAAGACA 


GTCACCAGAA 


GATTCGTCAC 


420 


TTCAATGAAG 


GAGACGTACT 


AGTGATTCCT 


CTTGGTGTTC 


CTTACTGGAC 


CTATAACACT 


480 


GGCGATGAAC 


CAGTTGTTGC 


CATCAGTCCT 


CTTGACACCT 


CCAACTTCAA 


CAATCAGCTT 


540 


GATCAAAACC 


CCAGAGTATT 


TTACCTTGCT 


GGGAACCCAG 


ATATAGAGCA 


TCCCGAGACC 


600 


AT GC AACAAC 


AGCAGCAGCA 


GAAGAGTCAT 


GGTGGACGCA 


AGCAGGGGCA 


ACACCGACAG 


660 


CAGGAGGAAG 


AAGGTGGCAG 


TGTGCTCAGT 


GGCTTCAGCA 


AACATTTCTT 


AGCACAATCC 


720 


TTCAACACCA 


ACGAGGACAC 


AGCTGAGAAA 


CTTCGGTCTC 


CAGATGACGA 


AAGGAAGCAG 


780 


ATCGTGACAG 


TGGAGGGAGG 


CCTCAGCGTT 


ATCAGCCCCA 


AGTGGCAAGA 


ACAAGAAGAC 


840 


GAAGACGAAG 


ACGAAGACGA 


AGAATATGGA 


CGGACGCCCT 


CTTATCCTCC 


ACGACGACCA 


900 


AGCCATGGAA 


AGCATGAAGA 


TGACGAGGAC 


GAGGACGAAG 


AAGAAGATCA 


ACCTCGTCCT 


960 


GATCACCCTC 


CACAGCGACC 


AAGCAGGCCC 


GAACAACAAG 


AACCACGTGG 


AAGAGGATGT 


1020 


CAGACTAGAA 


ATGGGGTTGA 


GGAAAATATT 


TGCACCATGA 


AGCTTCACGA 


GAACATTGCT 


1080 



CGCCCTTCAC GTGCTGACTT CTACAACCCA AAAGCTGGTC GCATTAGCAC CCTCAACAGT 1140 

CTCACCCTCC CAGCCCTCCG CCAATTCGGA CTCAGTGCCC AATATGTTGT CCTCTACAGG 1200 

AATGGAATTT ACTCTCCAGA TTGGAACTTG AACGCGAACA GTGTGACGAT GACTCGAGGG 1260 

AAAGGAAGAG TTAGAGTGGT GAACTGCCAA GGGAATGCAG TGTTCGACGG TGAGCTAAGG 1320 

AGGGGACAAT TGCTAGTGGT GCCGCAGAAC CCCGCGGTGG CTGAGCAAGG GGGAGAACAA 1380 

GGATTGGAAT ATGTAGTGTT CAAGACACAC CACAACGCCG TGAGCAGCTA CATTAAGGAT 14 40 

GTGTTTAGGG TAATCCCTTC GGAGGTTCTT TCCAATTCTT ACAACCTTGG CCAGAGTCAA 1500 

GTGCGTCAGC TCAAGTATCA AGGAAACTCC GGCCCTTTGG TCAACCCATA A 1551 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GCGGCCGCAT GGCCAAGCTA GTTTTTT 
(2) INFORMATION FOR SEQ ID NO: 17: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA [genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCGGCCGCTG GTGGCGTTTG TGA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 
GCGGCCGCTC TTCTGAGACT CCT 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GCGGCCGCAT GCCCTTCACT CTCT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
GCGGCCGCTG GGAGGGTGAG GCTGTT 
(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCGGCCGCTG AGCCTTGTTG AGAC 
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CLAIMS 

I . A method for reducing the quantity of a soybean seed storage protein 
in soybean seeds comprising: 

(a) constructing a chimeric gene comprising: 

5 (i) a nucleic acid fragment encoding a promoter that is 

functional in the cells of soybean seeds; 
(ii) a nucleic acid fragment encoding all or a portion of a 
soybean seed storage protein placed in sense or 
antisense orientation relative to the promoter of (i); and 
10 (iii) a transcriptional termination region; 

(b) creating a transgenic soybean cell by introducing into a soybean 
cell the chimeric gene of (a); and 

(c) growing the transgenic soybean cells of step (b) under 
conditions that result in expression of the chimeric gene of 

15 step (a) 

wherein the quantity of one or more members of a class of soybean seed storage 
protein subunits is reduced when compared to soybeans not containing the 
chimeric gene of step (a). 

2. The method of Claim 1 wherein the soybean seed storage protein is 
20 selected from the group consisting of glycinin and p-conglycinin. 

3. The method of Claim 1 wherein the nucleic acid fragment encoding all 
or a portion of a soybean seed storage protein is placed in sense orientation 
relative to the promoter region. 

4. The method of Claim 1 wherein the nucleic acid fragment encoding all 
25 or a portion of a soybean seed storage protein is placed in antisense orientation 

relative to the promoter region. 

5 . The method of Claim 4 wherein the nucleic acid fragment encodes the 
alpha subunit of the P-conglycinin soybean seed storage protein. 

6. The method of Claim 1 wherein at least two members of a class of 

30 soybean seed storage protein subunits are reduced when compared to soybeans not 
containing the chimeric gene of step (a). 

7. A method for simultaneously reducing the expression of two soybean 
genes comprising: 

(a) constructing a chimeric gene comprising: 
35 (i) a nucleic acid fragment encoding a promoter region 

from a soybean seed storage protein gene; and 
(ii) a nucleic acid fragment encoding all or a portion of a 
soybean protein that is not the soybean seed storage 
protein of (i). said nucleic acid fragment placed in sense 
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or antisense orientation relative to the promoter of (i), 
* and (iii) a transcriptional termination region; 
(b) creating a transgenic soybean seed by introducing into a soybean 
seed the chimeric gene of (a); and 
5 (c) growing the transgenic soybean seeds of step (b) under 

conditions that result in expression of the chimeric gene of 
step (a) 

wherein the quantity of one or more members of a class of soybean seed storage 
protein subunits and the quantity of the protein encoded by the nucleic acid 

1 0 fragment of (a)(ii) is reduced when compared to soybeans not containing the 
chimeric gene of step (a). 

8. The method of Claim 7 wherein the nucleic acid fragment encoding all 
or a portion of a soybean protein that is not the soybean seed storage protein of 
(a)(i) is placed in sense orientation relative to the promoter region. 

5 9. The method of Claim 7 wherein the nucleic acid fragment encoding all 

or a portion of a soybean protein that is not the soybean seed storage protein of 
(a)(i) is placed in antisense orientation relative to the promoter region. 

1 0. The method of Claim 7 wherein the promoter is derived from the gene 
encoding the alpha subunit of the P-conglycinin soybean seed storage protein. 

1 1 . The method of Claim 7 wherein the nucleic acid fragment encoding all 
or a portion of a soybean protein that is not the soybean seed storage protein of 
(a)(i) encodes a gene involved in fatty acid biosynthesis. 

1 2. The method of Claim 7 wherein quantity of one or more members of a 
class of soybean seed storage protein subunits and the quantity of the protein 
encoded by the nucleic acid fragment of (a)(ii) are reduced when compared to 
soybeans not containing the chimeric gene of step (a). 

1 3 . The method of Claim 1 3 wherein at least two members of a class of 
soybean seed storage protein subunits are reduced when compared to soybean 
seeds not containing the chimeric gene of step (a), and wherein the fatty acid 
profile of soybean seeds containing the chimeric gene of step (a) is altered when 
compared to soybean seeds not containing the chimeric gene of step (a). 

14. A transgenic soybean plant prepared by the method of Claim I . 

15. A transgenic soybean plant prepared by the method of Claim 7. 

1 6. Transgenic seeds derived from plants of Claim 14. 

1 7. Transgenic seeds derived from plants of Claim 1 5. 

1 8. A transgenic soybean plant wherein the quantity of one or more 
members of a class of soybean seed storage protein subunits is reduced in the 
seeds of said plant when compared to seeds derived from a non-transgenic 
soybean plant. 
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1 9. Transgenic seeds derived from plants of Claim 18. 

20. A transgenic soybean plant wherein 

(i) the quantity of one or more members of a class of soybean seed 
storage protein subunits is reduced; and 

(ii) the oleic acid content relative to the content of other fatty acids 
is increased in the seeds of said plant when compared to seeds 
derived from a non-transgenic soybean plant 

2 1 . Transgenic seeds derived from plants of Claim 20. 
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Figure 2. 
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Figure 3. 
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Figure 4 
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Figure 5. SDS-PAGE of protein extracted from transformed 
somatic embryos Jot-1 and Jo1-4 
(10% gel, coomassie stained) 
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Figure 6. 
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Figure 7. SDS-PAGE of protein extracted from transformed 
soybean seeds of G94-1 andG94-19 
(10% gel, coomassie stained) 
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Figure 8. 
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Figure 9 



Hindi!) 




9/9 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
C12N 15/82, 15/29, A01H 5/00 



A3 



(11) Internationa! Publication Number: WO 97/47731 

(43) International Publication Date: 1 8 December 1 997 ( 1 8. 1 2.97) 



(21) International Application Number: PCT/US97/09743 

(22) International Filing Date: 1 0 June 1 997 ( 1 0.06.97) 



(30) Priority Data: 

60/019,940 



14 June 1996(14.06.96) 



US 



(71) Applicant (for ail designated States except US): E.I. DU PONT 

DE NEMOURS AND COMPANY [US/US]; 1007 Market 
Street, Wilmington, DE 19898 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): KINNEY, Anthony, J. 
[GB/US]; 609 Lore Avenue, Wilmington, DE 19809 (US). 
FADER, Gary, Michael [US/US]; 100 Woods Lane, Lan- 
denberg, PA 19350 (US). 

(74) Agent: CHRISTENBURY, Lynne, M.; E.I. du Pont de 
Nemours and Company, Legal Patent Records Center, 1007 
Market Street, Wilmington, DE 19898 (US). 



(81) Designated States: AL, AM, AU, AZ, BA, BB, BG, BR, BY, 
CA, CN, CU, CZ, EE, GE, HU, IL, IS, JP, KG, KP, KR, 
KZ, LC, LK, LR, LT, LV, MD, MG, MK, MN, MX, NO, 
NZ, PL, RO, RU, SG, SI, SK, TJ, TM, TR, TT, UA, US, 
UZ, VN, YU, ARIPO patent (GH, KE, LS, MW, SD, SZ, 
UG), Eurasian patent (AM, AZ. BY, KG, KZ, MD, RU, TJ, 
TM), European patent (AT, BE, CH, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ, CF, CG, CI, CM, GA, GN, ML, MR. NE, SN, TD, TG). 



Published 

fVith international search report. 

(88) Date of publication of the international search report: 

30 September 1999 (30.09.99) 



(54) Title: SUPPRESSION OF SPECIFIC CLASSES OF SOYBEAN SEED PROTEIN GENES 



(57) Abstract 



This invention concerns the construc- 
tion of transgenic soybean lines wherein the 
expression of genes encoding seed storage 
proteins are modulated to effect a change 
in seed storage protein profile of transgenic 
plants. Modification of the seed storage pro- 
tein profile can result in the production of 
novel soy protein products with unique and 
valuable functional characteristics. 




EcoRl 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Ann cnia 


FI 


Fmland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


PR 


fiance 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MO 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of* Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







INTERNATIONAL SEARCH REPORT 



.ernsc i Application He 

PCT/U3 97/09743 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 5 C12N15/82 C12N15/29 A01H5/00 



Accoraing to international Palam Classification UPC) Of to both national classification and IPC 



3. FIELDS SEARCHED 



Minimum documentation searched {Classification system foUowed by classification symooisj 

IPC 6 C12N A01H 



Documentation searcfieo other than rrwiimum documentation to the extent that sucn documents are included in the fields searcned 



Electronic data oase consulted dunng the international search (name of data base and. wnere practical, search terms useot 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' Citation ot document with indication, where appropriate, ot the relevant passages 



Relevant to claim No. 



WO 95 27068 A (PIONEER HI BRED INT) 12 
October 1995 
see page 18 

EP 0 620 281 A (MITSUBISHI) 19 October 
1994 

see the whole document 

-/-- 



1,14,16, 
18 



1 

20 



Further documents are listed in the continuation ot box C. 



Patent family members are listed m annex. 



Special categonas or ctt-rO documents : 

A' document oefinmg the general state ot the art which is not 
consKterea :o oe ot particular relevance 

E ' earlier document but Published on or after the international 
filing date 

L 1 document wmcn may Tirow deuDta on priority cfaimfsl or 
wnicn is ated to establish the publication date ot another 
citation or :thor specal reason ias specified) 
O* document refernng tc an oral aisclosure. use. exhibition or 
other means 

'P* document ouetished r*>or to tne international riling date but 
later man :ro pnonr> sate ciaimeo 



T" later document published after me international tiling date 
or priority date and not m conflict with the application but 
cited to understand the prince w or theory underlying the 
invention 

"X" document of particular relevance: the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken atone 

"Y" document or particular relevance: the claimed invention 

cannot be considered to irrvorve an inventive step when the 
document is comomed wrtn one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"V document member :t the same patent family 



Data of the actual comptetcn of ihemternational search 



5 November 1997 



Oate ol mailing ot me international searcn rwort 

25/11/1997 



Name ana matfirg address of me iSa 

Eurooear =atent Cmce. P S. 5at8 Palentlaan2 
N IL - 22 z ' hv Piiiswiik 
■*i. ( -3'-~3) 340-2040. Tx. 31 65t epo nl. 
= ax ,-3--~0) 3JO-3018 



Aulhonzed officer 



Maddox. A 



page 1 of 3 



INTERNATIONAL SEARCH REPORT 



.t«rnati ' Application Mo 

PCT/US 97/09743 



C.fContinuattom DOCUMENTS CONSIDERED TO 8E RELEVANT 



;ai«qory CtMiion ol Jocum*nt. win indication. where aopropnate. n th« relevant passages 



Relevant to claim No. 



CHEMICAL ABSTRACTS, vol. 111. no. 23, 
1989 

Columbus, Ohio, US; 
abstract no. 212128, 

OGAWA. T., ET AL. : "Genetic Improvement 

of seed storage proteins using three 

variant alleles of 7S globulin subunits in 

soybean (Glycine max L.)" 

XP002045645 

see abstract 

& 

IKASHUGAKU ZASSHI , 
vol. 39, no. 2, 1989, 
pages 137-147, 

BIOLOGICAL ABSTRACTS, vol. 98, 
1994 

Philadelphia, PA, US; 
abstract no. 83825, 

TAKAHASHI, K . , ET AL.: "An induced mutant 

line lacking the alpha-subuni t of 

beta-conglycinln soybean(Glyc1ne max(L.) 

Merrill)" 

XP002045643 

see abstract 

& 

BREEDING SCIENCE, 
vol. 44, no. 1, 1994, 
pages 65-66, 

BIOLOGICAL ABSTRACTS, vol. 101, 

Philadelphia, PA, US; 

abstract no. 137658, 

YAGASAKI, K. , ET AL. : "Inheritance of 

glycinin subunits and characterization of 

glycinln molecules lacking the subunits in 

soybean (Glycine max (L.) Merr.)" 

XP002045644 

see abstract 

& 

BREEDING SCIENCE, 
vol. 46, no. 1, 1996, 
pages 11-15, 

BEACHY R N ET AL: "ACCUMULATION AND 
ASSEMBLY OF SOYBEAN BETA-CONGLYCININ IN 
SEEDS OF TRANSFORMED PETUNIA PLANTS" 
EMBO JOURNAL , 
vol. 4, 1985, 

pages 3047-3053, XP002035495 
see the whole document 

EP 0 591 530 A (RICE BREEDING RESEARCH 
LABORATORIES) 13 April 1994 
see tie whole document 

-/-- 



1,2.5 



1,2,5 



1,2,5 



form PCT ISA.2U iccrttnujncr :• Mcond tn#M|iJuy 1992} 



page 2 of 3 



INTERNATIONAL SEARCH REPORT 



•mar ' Application 

PCT/US 97/09743 



(^Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category 


Citation 01 Jeouxrwnt. with indtcjtion.wnaf* appropriate. 01 tn« reMvant passages 


R4Uvant to claim No. 


A 


UTSUMI S ET AL: "SYNTHESIS. PROCESSING 

AND ACCUMULATION OF MODIFIED GLYCININS OF 

SOYBEAN IN THE SEEDS, LEAVES AND STEMS OF 

TRANSGENIC TOBACCO" 

PLANT SCIENCE, 

vol. 92, no. 2. 1993, 

pages 191-202, XP002035496 

see abstract 


2 


A 


DATABASE WPI 

Section Ch, Week 9030 

Derwent Publications Ltd., London, GB; 

Class 016, AN 90-228488 

XP002045646 

& JP 02 156 889 A (KIRIN BREWERY KK) , 15 
June 1990 
see abstract 


2 


A 


PARK. Y.-D., ET AL. : "Gene silencing 

mediated by promoter homology occurs at 

the level of transcription and results in 

meiotically heritable alterations in 

methyl at ion and gene activity" 

THE PLANT JOURNAL, 

vol. 9, no. 2, February 1996, 

pages 183-194, XP002045640 

see the whole document 


7 


A 


BRUSSLAN, J. A., ET AL. : "Isolation of new 

promoter-mediated co-suppressed lines in 

Arabidopsi s thai iana" 

PLANT MOLECULAR BIOLOGY, 

vol. 27, 1995, 

pages 809-813, XP002045641 

cited in the application 

see the whole document 


7 


A 


HEPPARD, E.P., ET AL.: "Development and 

growth temperature regulation of two 

different microsomal omega-6 desaturase 

genes in soybeans" 

PLANT PHYSIOLOGY, 

vol. 110, no. 1, January 1996, 

pages 311-319, XP002045642 

see page 317, left-hand column 


11,20 


A 


WO 94 11516 A (DU PONT ; LIGHTNER JONATHAN 
EDWARD (US); OKULEY JOHN JOSEPH (US)) 26 
May 1994 

see page 83 - page 98 


11,20 



Form f*CT ISA 2 10 ■ :-"nnuj!H:r " wcorw sn*«H uuy 19921 



INTERNATIONAL SEARCH REPORT 

inr. aalion on patent family memtw-i i 



PCT/US 97/09743 



i Patent document 


Publication 




Patent family 


Publication 


j cued m searcn report 


date 




member(S) 


date 


WO 9527068 A 


12-10-95 


AU 


2273795 A 


23-10-95 






EP 


0754236 A 


22-01-97 



EP 620281 A 19-10-94 



EP 591530 A 13-04-94 



WO 9411516 A 26-05-94 



CA 2120065 A 01-10-94 

CN 1105704 A 26-07-95 

JP 6335334 A 06-12-94 

US 5530192 A 25-06-96 



WO 9318643 A 30-09-93 
AU 1423992 A 21-10-93 
US 5516668 A 14-05-96 



AU 5407594 A 08-06-94 

CA 2149223 A 26-05-94 

EP 0668919 A 30-08-95 

JP 8503364 T 16-04-96 



Foim PCT1SA/2IO ipjrtnl tamn umui iJuty 1992) 



